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modified DNA polymerase Polymerases and to a purified modified gene encoding a 

nucleotide base (either guanine (G). adenTe (A) t^'S m ! ^ 'P^'^'^ S-Ven 

fragments are each separated on the bSs of L r !. ^' °' "^"^ '^"^ ^'««ran« sets of 
band on the gel corresponds cot 'arlylo : speci e t^ e^ ' T ^ ^^^'^^'^^ Qel: each 

positions in the sequence of the given nVrotS base ' ''^-"«*y'"9 the 

Invol^SrchlTcaTC^^^^ °- ^^^^^ ^ ^'^^ 

defined terminus, each reaction yiellg almlted 11 ^ ' """'^'^ ^'^ ^ ^^'I'o'abel at its 
A, T or C). The other method (didety sepu^^^^^^^^^^ ^ °' 'o- bases (G. 

separate syntheses are run. each reacS bSa ^^^^^^^^^ <^ - ^^A strand. F^ur 

incorporation of the appropriate chan terminiSn Hinf ® ^ "^^^ A. T or C) via 

the DNA fragments are uniformly TbeCSSd orZTiTn; ^'l "^"'"'^ ^^^'^^^ 
contain increasingly more radioactivity FurtC's^^^^^^^^^ '=^9^^ DNA fragments 

labelled nucleotides, resulfing in sharper dSon- at^Jh! ^ ''^ "^^^ P'^^^ ^P" 

each lane corresponds only lo either G A it C '"'«'P^«t since 

Escherichia coH DNA-polymerase I large fragment (^JenownVl" ^"^"^'"9 the 

transcriptase. ^ 'ragment ( Klenow ). Another polymerase used is AMV reverse 



25 Summary of the Invention 



met "odToSs^^^riS f rdLTS'wh- ' "^^^ -*ch 

merase which has sufficient DNA polylrase Sivt for^^ '"^^^^^ ' '"""""'^ °NA poly- 

.0 combined with any cofactor necessaryT^rDNrnoTvl '"'"'"'^'"^ ^"^^^ ^^'^ P^'V-^^^se is 
modification of a naturally occuring genT r^dified in tt ^""^ the 

exonuclease domain of said naturally ocurrina dna 1 °' """'^ ^^'d^ the 3' - 5' 

that naturally occuring at the site U bStion t^TeSLV^ ^" ^'-^^ 

. °Tr;;r~;;:-^^^^^^^^ -rally^rN^^^^^^^^^^ - o, naturally 

TT-type DNA polyrra^ :Spo Se to rem '1 ^ -^--'^ -«^'«ed 

conditions normally used for DNA sequenJn^SSil^^^^^^^ h k"? *° ''^^ ""^^r 

for use in DNA sequencing when said Sa^rc^mSL J '2 ' r""""""* "'"'^ P^'^'"^^^^^ ^^^^ty 
polymerase activity and which results fr^rmod i iSbn of. "^'^^^^^^ '^'^ DNA 

the activity of naturally occurring 3-- 5' e)^onuHpjl . °^'"'""9 -modified to reduce 

wherein one or more lino acids of the Tnud a^e doS °' '^V^*"-"^ ^NA polymerase 

polymerase of said naturally occurring DT polvmerar n? T "^'"^ '""^'"^ °* "^^ DNA 
DNA polymerases, are repLced b^ an a^,iJo ^Th ?h I ~^'«^P°"*"9 domain of other T7-type 
substitution or are deleted. ''^ ^ ^^'d other than that naturally occurring at the site of 

the corresponding domain of other TT^prDNnoTlr^l "'^'^ Po'^'nerase. or 

that naturally occurring at the site of substSution t S^deler* 

. encoSeStr "el^^^^^^^^^^^ -nfion the modified polymerase 

normally used for DNA sequencing beforrdi sociaZ mr„ 7 J"^' '^"'"^ ""^^^ 
Poymerase activity of the modified polymerle ifat Sargirorfh^ 7,^' '•'^ ''"^^^ 
polymerase: the gene encodes a modified 001^!™! ^ u °' "^•"'^''V ocu^ng DNA 

infected with a T7-type phage (ilXe in STe ^^^^^^ 

' subunit) for example, the T7-type phage fe T^^ ^3 *, Z 2 l^'TIT '^"''^^ thioredoxin as a 
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•sz "arc.T~:~Sr ^^^^^^ - -~ 

DNA Polymerase are replaced or deleted ^ ''^^ °' "^^^^'^ °«="^in9 T7 

. Pri.r i^:rrrpt™?„T™^ non-dlscri.l„a«„g. and can utilize short 

above described metLs. andl parculHr dSa ""^^^^ P'^P^^'^^ the 

radioactivity in the polyacy lamidrgSt n^jl a Lrir!^ T"^ '"^^ background level of 

sharp - making the 6na sequence easyt C Furthr "° f ^"'^ '^^^^ ^« 
sequencing long DNA fragments, as is desUe^tn derb^to^^^^^ ' ''"'^ '"^"^"^^ °^ 

preJed js:^:;:^ ttsjfirsr ^'^^^^-^ '""^ '^"-'-^ ^--'p«- 

DgscrjEtjon of ttig Preferred Embodiments 
20 The drawings will first briefly be described. 
Drawings 

DNA Polymerase 

' cloernrdSi^^^^^^^ P--i- has no associated exonuclease activity. 

as tetramers. hexamers and ortleX ScTorZf °«9°n"Cleotides (such 

«amers} as specific pnmers. These properties will now be discussed in detail. 

Processivitv 

using ^rSnlprer'eT^^^^^^^^ T *° '"^P-'^ -ny nucleotides 

DNA sequencing^xtens^: rel:,rs t^^^^^^^ ""^^^ -"^^'^-^ —I. y used for 

incorporate only a few bases before dis?ociat3e o ^Icl""" Polymerases: some 

10 bases). T5 DNA polymerase ^ou 180 S^^^^ ^^"^"^ ^® DNA polymerase (about 

Biol. Chem. 254:122%TBamba?a Vt 3? J^lrcZ^ll iT':'^' "^^^^ =1. J. 

present inveTSiSn. will remain bouTd for at ,eaS soo^l! ^" ^? ^'"^ 
suitable environmental conditions. Sucrenvi onmemal tlnn, T T'^'" ^'""^ "^^^ 

four deoxynucleoside triphosphates and a^ZTt^ 'l^f ' ^ ' ^ ""P"""' °' 

enhanced in the presence of 'e. co^ sin^rsS^b^rg^'S^^^^^ 

haveTl~ cZT^rgl^r s^urasTJj"^^""" ^' ^hk^h 

processive. then artifacfu.l H.n^:Z^ <«aeoxynucleotide. If the DNA polymerase is non- 

nucleotide where the polymerase d s^Slf^ P^ sequencing reactions, at positions corresponding to the 
incorrect positions and?t! cures "her^^^^^^ 'J^'^" « background of bLs at 

the reaction mixture for a long time (30^0 mJ^S r^' k ' " ^^"^ '^"^^ "y incubating 

artifactual bands up to a high m^i^S^lrh, I ^'"9^"'"'^"t'««°n <" substrates, which "chase" the 

sequence is read This rnoTrSaTiorsf^^^ '"'^ '^"^ ^^S'"" ^'^'^ ^^^A 

an ideal solution since a non-processive DNA polymerase has a high 
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5 Analog discrimation 

same as that of a normal nucleotide o lt\^^ t^—rih ""T" 1"" approximately the 

'0 Of a nomial analog. Tl,e polymerais ofthktln^^^ I ^^"^ ^* ^'^0 «"i<=iency that 
other analogs. This is imp^r^^n, ih7e in a^Z rnZT '° «9^'"«t 
dATP. d-rfp and dCTPrSuencino rfr^^^ 

denvauves such as: SI iSyrt^,^^^^^^^^^ '""^r^^""" °' ^'^^ «yP«« nucleotide 

synthesized strands with -s. -P orl^rXS^^^^^^^^ "'"'"^ '^^""9 

s against analogs the same probaWirv^nrex27or iTl . ' f""" ^^'""^^'^ ^oes not discriminate 
For labelled nucleoside triphosphates Is is iloll. ! " °' ^"""^ ^ ^ 
strands using a minimum of ralSirF'rtl^er'C LtS T'"''"'' ^''^ 
"'"Krr ^7 reaction cheaper^th«' IXrilVn 

At such impediments therwillT^aZi v ' '^^""''^^ -P«d'"'«nt. 

Which may obscure the sequence ' ' '"'""'"^ °' ^^^'"^'^"^^ "ands on the gel. 
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.ess^^rrtrntr^r^^^^^^^ - -n t.. and most preferably 

polymerase molecule). By normal or LrainLoStt T"^^^^^^ 

unmodified TT-type polymerase NormalL fh! ^ f ! " ""^^"^ ^^"^ exonuclease activity of 

. activity per mg of poi;m;ra;e m« r:r! !™ ^^"^'^^ 5.000 units of 

exonuclease 

activities are detrimental to the oualitv of nwA c^-, template. Such associated exonuclease 

concentration of nucleotide p^cu^so^fi ^^^^^^^^ ^""'^ "'^^ '^^"-d 

concentration falls, the polymerase actM v sZJ ro . f ' "'^ ""C'eo«de 

resu^-nginnonetDNAsUero^erC^ro t;^^^^^^^^^^ ^<^'>"ty. 
te Jr=r.^Sr^^^^^^ ^ to id. . reg^ns In the 

synthesis decreases as^ it struggles triT^^sociatSr^ ''"'^''"^ " ^^^'^'^^^ '^'^ °* 
DNA When the polymerase staSs. Is a ^Seq^enTnTrS^^^ "^^'^ 
This may result in the polymerase even^lirnr. T^^^ °* ^"^^^^'^ ^^'^^^^ occur. 

Of the sequencing reaction) or the ooC^I^TT''^ ^^"^ "° ^ the quality 

artifactual band a^ the sam^ ^sTl'taTZ sZn~ ^-^^"^ - 

incorporated at a high frequency and produce a wWe vSaS in L ' f ""^r ™'"^*"9 «9ent may be 
sequencing gel. This happens because the freourc! If n ^ T °* '^^9"'«"ts in a 

given Site increases witti the number of oZT % °' « chain terminating agent at any 

terminating nucleotide. aJd so the dna p C^^^^^^^ '° incorporate me chj 

higher frequency at sites of idling than at o2 siles ' ' chain-terminating agent at a much 

esse^i^t'or^CtSe^SS ll^T: 'Ti:' TT •^-"^"^ ™^ 

variable intensity of radioactive blS artier ht^' T '^^S-^^"*- there is 

uniform radioactive intensrty of a! f^agmrnte ^e DMA ' "'."""^ undetected. To obtain 

at each position on the DnI, show X p^^ce^t^^^^^^^ h'^ " *^ ^^-"^ °' 

given Site. This occurs if the DNA ,^ lymerai lZ 1 l^n^ °' ""'^'^♦''^^^ ^* 

one oppo^nity to incorporate a ch.^T tZSnrnucSdrre^^^^^^^ ^ r;-" 
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advantages to DNA sequencino ^f^hl^ ^ "^'"^^ P"""^'^ « """"^er of important 
usual ^Lo^er XTS,^^^ f *° to synthesize th^ the 

the sequencing reaction faster FuZ ?hL fh^^L^ complementary sites on a DNA template, thus making 
Primers for DNA sequendnrperm^ "^^^^^^ °' ""^"^ °«9on"cleotide 

For example, a kit^ntainl aoT^Vor^^^^ '"^""""'"^ D'^'^ 

to any sites in the cloning vt^oVTSi^ one oTf S h!"'''*'"' °' complementary 
every 50 bases along the DNA fragrTent to £ seo. °L*h «° "^f ,"'«^««'^"«"<=es will occur an average of 
would require only five seauTnc ^^1! p ^"^^ determination of a sequence of 3000 bases 

sequence's' GTA^^S^cmryLl'^^ ^ h?"" - ^"^'^"^ -^^^ll, 

insert. Using the results from *f Lq enc o l^^n '^^^^ ^» °' •'^^ 

homologous to a region near the e^doXdet^rn^n^^^^^^^^ ' new pnmer would be picked from the kit 
next 600 bases would be d«ei inf Z n '"T"' ^ 

determine the complete sequenZ the aSbSe^ °' '"^^^ ^°"'d 

Chemical synthesis of any "390 ucSe o^^ S "T"'f '"^ ^""^ ^'"'°"t ^he 

inc,ud.g gene 2.5 and ^^roj T^^^ZZ^Z^:' ^^^^^ 

polyrasL Th^TLVNA^^^^^^^^^^^ ''-P^^-) '-"^e modified T7-type 

fally identical to a ^r^^'r^mSr^^^^^^^ ^ ' ^"^ ^''^^ ^"^^»-- 

such as T3. ^1. ^11. H. W31. gh-7 J SPB ^^rf:! '''"^ '^^^'^ P^-9e, 

properties similar to those of the modSied T7 en-- » ""^'"^ '° ^'^^^ 

infected cells directly but DreferLJ^^I l ^ l * " ""''^'^ *° ''^^ «"^y'"« from phage 

tially identical is mlVthat ^ en^^^^^^^ " isolated from cells which overproduce it. By substan- 

properties of the enzyme. One examl of a oaZlTH T''"''"' ''''''' '° 
the natural enzyme is modified to Sve anv Inn T ' ""'^ substitution is one in which 

the genetic or chemical leve (see bZ) """"'y- "^'^ -^edification may be performed at 
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andl"ofTDSrpllZi::r™s"^^^^^^^^^^ ''^ overproduction, purification, modification 

a one to one stoichiome,rS i he phaoeT en'^^^^^^^ °' '"^ Polypeptides tightly complexed in 
150 J. Biol. Chem. SSIS.igrsrthe oLr is ,hP F^^^ h ^ '''^^^^^ °^ ^'^^ ('^^'rich et al. 

J. Biol. Chem. 262-16 21^1987; Th^th w ^' ^ ^""^^"^ thioredoxin, of 12.000 daltons (Tabor et al 

(the non-proceJe actual orpll'^T/^^^^^ " '"^^''^ 9^"- ^ protei 

very active 3- to 5' exonuclease alocSriT T^^^^^^^^ P"'^"'^^^^ ^ 

sequencing and must be inactlated Tm^^ ..h k J polymerase useless for DNA 

performed, as described beC SLrcSSv , ^'T''''^ ™^ ^««dily 

genetically, by modifying the codC^orCiyr^^^^^^^^ ^ 
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s^irr^rCeni^a^dr^^^^^^^^ r ^-'^"^ 

Thioredoxin and Glutarprin Vn Systeml^rurtum fnH^ » ? ""^ ^ al.. in 

Press. NY: and Tab^ rir^ su^ri he ItaatlTn^^ 285-300, Raven 

plated onto trxA" ceils, and IhStlo n rVc"''''*"' ^ ♦''^ ""'rture 

tn<A gene oTiTy those phages cS^ ^ ^'^^ 

located on a 470 base pair Hinci^fraZnt " "^^ ^ genes-wi?i 

CentT^Cthrrgte w^^^^^^^ ^'-V- 470 base pair 

Uboratory Manual, Cold SDrinJlHa*„r i »h! ^ t procedure {(\4aniatis et al.. Cloning: A 

pBR322Ltaining a Rac prZter /^^^^ *° ^ <^^"-a«v° 

12. containing ^-la'ctamase aTS w^ut 1 "^nt" Z'''^- '^^^ Pt^- 

then ligated to two tandem copies o 3 '^i^^i^l ^ ' ^'^"^ ^ °' ^290 bp. which was 

aem copies of trxA (Hmcll fragment) using commercially available linkers (Smal- 
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5 pGPM and mGPl-2 



polymerase. Gene 5 protein is lethal 101111 Si Sains In '".'"""^ °^ ^ "'^'^ 

'0 by Tabor et al. 82 Proc. Nat. Acad. Sci To74 fl S ll °' " ^^^^-^ described 

*10 promoter, and is or,.y expressed w^en^^ RN?LtmlZ ' "^'^ °' 
Briefly dGP5-5 (Fin \\ II 7 ? ! ' polymerase is present in the cell. 

5667 (Hindi) to 6166 (FnuWcon ^TTranfriP '° '"^"^^"^ °' ^ 

'5 T7 RNA polymerase. a'Sdthe 3kb eZSi HinH f * '^^ Promoters, which are recognized by 

1141. 1978). The nucleotide sequencB"^r?7 n^^^^^^^^^^^ ^""'^"^ ^ ^^^t-J 

5 is only expressed when T7 r3a ^ylZJJs .T,^^^^^^^^^ ^^'^ ^'--'^ 9ene 

^0 polymerase wal igated u ^9 '^LrsS andYa. '^^^^ Tf"V° »' ^ «NA 

polymerase gene under con'ol oT hX proSerT^:!'^^^^ 

shown in Fig. 9. — The complete nucleotide sequence of mGP1-2 is 

.eyrbftr^^trd^ ^^^^^^^^^ ~n <respec«.ely a PISA and a ColEI origin, 

26 presence of IPTG. mGP1-2 can coeiTSe s2^e T ' °' t^e 

expression of T7-DNA polymerase S TG^S^ Tr^rL c P'^^"^'^^ be used to regulate 

inducing the lac promoter with. e.g.. IPTG ^ ^ ^ ""^ production of T7-RNA polymerase by 

Overproduction of T7 DNA polymerase 

30 

strategy the two genes are oo^!ZS^Tt^\ZleTZ'S ''^^'"^ 
proteases until thioredoxin is bound to it.) As descrLTL df « ^"'^ ^'"^ ' 

35 genes separately on each of two compat^bte pSSs In « «' T ^'^"^"'^ *° ^^^'^ '^'o 
be placed in tandem on the same P^ It^Tmr^^^^^^ Alternatively, the two genes could 

a non-leaky inducible promoter, such as « TaTTb * h fin ^'^^"^ ' " ^"^"^ '''^ ^""t^"' °' 
quantities of the two polypeptides together is toxic in mnT, V ■ 7' synthesis of even small 

than 500 molecules of the gene prS^Ie oSu.^T. "^"''"""^ "-^^"^ ^^s 

^ promoter, controlling the gene's exZs7on rno, L, \'^! o ^^"^ '^-^ 9^"^ *^hen the 

system Is used although ott^er expTessS;^^^^^^^^^ ^ "'^'^ P^'^--^^^ expression 

leaky promoter, e.g.. ^lac. allows mr^hrMTmnllp r ! P^*"^"*"^^ "^ed. A 

induced, thus cells containing leLTgenrs unde^ I con;^;, , ''°k" '° ^^^^ 
suitable in this invention. It ,s of courrLssfbt To nTn 1'"'' ^ ^^^^ P°°^'y are not 

« lethal, for example, the plac promoter "suSfL sucE cet" "'"^ "'^^ 

ce.ls"c:nSrg^:tdSt r;^^^^^^^ r^-^-V- ^^-ng this strategy, the 

Which point the two polypeptides form ^actite' n^C-y "^^^^^^ *° ''"""'"^ ^'^^^^^^ 

50 B^mptel: Production Of 

A Mnste colon, is gro™ owoijS'o 25 « »Ld JSa IT^,'? '^""""^ 
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arge culture. 4 X 500 ml 2X LB Is inoculated with 4 X 5 ml 71.18 cells grown in M9. and is shaken at 37-C 
When the large culture of cells has grown to an A590 = 1.0 (approximately three hours), they are inoculated 
with 10 ml of supematant containing the starter lysate of mGP1-2. The infected cells are then grown 
overnight at 37-C. The next day. the cells are removed by centrifugation. and the supernatant is ready to 

5 use for induction of K38/pGP5-5/pTDc-2 (see below). The supematant can be stored at 4-C for approxi- 
mately six months, at a titer -5 X 10" */ml. At this titer. 1 L of phage will infect 12 liters of cells at an 
A590 -5 with a multiplicity of Infection of 15. If the titer is low. the mGP1-2 phage can be concentrated from 
the supematant by dissolving NaCI (60 gm/liter) and PEG-6000 (65 gm/Iiter) in the supematant. allowing the 
mixture to settle at O'C for 1-72 hours, and then centrifuging (7000 rpm for 20 min). The precipitate, which 

10 contains the mGPI -2 phage, is resuspended in approximately 1/20th of the original volume of M9 media 

K38/pGP5-5/pTrx-2 is the E. coH strain (genotype HfrC (X)) containing the two compatible plasmids 
PQP5-5 and pTrx-2. pGP5-5 plasmid has a PISA origin of replication and expresses the kanamycin (Km) 
resistance gene. pTrx-2 has a ColEI origin of replication and expresses the ampicillin (Ap) resistance gene 
■? ^l^^rt^ introduced into K38 by standard procedures, selecting Km" and Ap" respectively The 

15 cells K38/pGP5-5/pTrx-2 are stored in 50% glycerol at -80 'C. Prior to use they are streaked on a plate 
containing SOug/ml ampicillin and kanamycin. grown at 37-C overnight, and a single colony grown in 10 ml 
LB media containing 50ug/ml ampicillin and kanamycin. at 37-C for 4-6 hours. The 10 ml cell culture is 
used to inoculate 500 ml of LB media containing 50ug/ml ampicillin and kanamycin and shaken at 37-C 
ovemight. The following day. the 500 ml culture is used to inoculate 12 liters of 2X LB-KPO4 media (2% 

20 Bacto-Tryptone, 1% yeast extract, 0.5% NaCI. 20 mM KPO4. 0.2% dextrose, and 0.2% casamino acids. pH 
7^4^^. and grown with aeration in a femientor at 37-C. When the cells reach an A590 =5.0 (i.e. logarithmic or 
stationary phase cells), they are infected with mGP1-2 at a multiplicity of infection of 10. and IPTG is added 
(final concentration 0.5mM). The IPTG induces production of thioredoxin and the T7 RNA polymerase in 
. u"®"''® production of the cloned DNA polymerase. The cells are grown for an 

25 additional 2.5 hours with stimng and aeration, and then harvested. The cell pellet is resuspended in 1 5 L 

200 ml 10% sucrose/20 mM Tris-HCI. pH 8/1.0 mM EDTA. and frozen in liquid N^. From 12 liters of 
induced cells 70 gm of cell paste are obtained containing approximately 700 mg gene 5 protein and 100 
mg thioredoxin. 

30 K38/pTrx-2 (K38 containing pTrx-2 alone) overproduces thioredoxin. and it is added as a "booster" to 
extracts of K38/pGP5-5/pTrx-2 to insure that thioredoxin is in excess over gene 5 protein at the outset of the 
purification. The K38/pTrx-2 cells are stored in 50% glycerol at -80 -C. Prior to use they are streaked on a 
plate containing 50 ug/ml ampicillin. grown at 37-C for 24 hours, and a single colony grown at 37-C 
ovemight in 25 ml LB media containing 50 ug/ml ampicillin. The 25 ml culture is used to inoculate 2 L of 2X 
35 LB media and shaken at 37 • C. When the cells reach an Asso = 3.0. the ptac promoter, and thus thioredoxin 
production, is induced by the addition of IPTG (final concentration 0.5 mM). The cells are grown with 
?n oM pn^V'^t^ ^«S"sP«"ded in 600 ml 10% sucrose/20 mM Tris- 

TWcMr. u » I^V^""^ '^'^P""- '"'"^"y- ^"^ resuspended in 40 ml 10% sucrose/20 mM 

rns-HCI. pH 8/0.5 mM EDTA. and frozen In liquid N2. From 2L of cells 16 gm of cell paste are obtained 
40 containing 150 mg Of thioredoxin. 

Assays for the polymerase involve the use of single-stranded calf thymus DNA (6mM) as a substrate 
M ou ''/T'^ '^mediately prior to use by denaturation of double-stranded calf thymus DNA with 50 mM 
NaOH at 20 -C for 15 min.. followed by neutralization with HCI. Any purified DNA can be used as a 
template for the polymerase assay, although preferably it will have a length greater than 1.000 bases 
* . ,0.-^1''^'^ ^^'^ polymerase assay used is a modification of the procedure described by Grippo 
T lii! u ;c ^'^"^^'^ '^i'' (200 ul final volume) contains 40 mM 

ins/HOl PH 7.5. 10 mM MgCb. 5 mM dithiothreitol. 100 nmol alkali-denatured calf thymus DNA 0 3 dGTP 
dATP. dCTP and I3H]dTTP (20 cpm/pm). 50 ug/ml BSA. and varying amounts of T7 DNA polymerase' 
Incubation is at 37-C (10-C-45-C) for 30 min (5 min-60 min). The reaction is stopped by the addrtion of 3 
50 ml of cold (0- C) 1 N HCI-0.1 M pyrophosphate. Acid-insoluble radioactivity is determined by the procedure 
of Hinkle et al. (250 J. Biol. Chem. 5523. 1974). The DNA is precipitated on ice for 15 min (5 min-12 hr) 
m^r^T.'^M u!f.i°»° 9'3ss-fiber filters by filtration. The filters are washed five times with 4 ml of cold 
(0 C) 0. M HCI-0.1 M pyrophosphate, and twice with cold (0-C) 90% ethanol. After drying, the radioactivity 
on the filters is counted using a non-aqueous scintillation fluor. 
is One unit of polymerase activity catalyzes the incorporation of 10 nmol of total nucleotide into an acid- 
soluble fornr. in 30 min at 37-C. under the conditions given above. Native T7 DNA polymerase and modified 

Inrv^^o^n'"^'^® "^'"'"^ ^® ^P®"*"^ polymerase activity t 20%. which ranges between 

5.000-20.000 units/mg for native and 5.000-50.000 units/mg for modified polymerase) depending upon the 
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preparation, using the standard assay condrtions stated above 

Atter 45 min at 0-C the S am'nTr J °' '^^^'^^ mg/ml) and 10 ml of NaCI (5M) are added, 

cells are then lo^eri ^l.T^TS^^^^^ TTT '''"^'^ '''^ 

37.C water bath. After thawing, the cells arJenSy li^^^^^^^^ S ^j*' ^ 

one hr at 35.000 rom in a RAr^irmon >irT" ! ^ ®° "^'"* ^® 'y^^*® »s centrifuged for 

IS allowed to sit for 60 min and the re<!iiitinn nr^..:r^:*o♦^ .... J suspension 
The precipitate is -dissolv:? ir/cJ ^ ^20^'^^^^^^^^ 

EDTA/10% glycerol (Buffer A). This is fraction II 2-mercaptoethanol/0.1 mM 

" n is ^CdVvLr^raS^^^^^^^^ :/irr ^^^^^^ ^- 

conductivity equal to that of BuSir TlZHa IOoLm m J.'n''. '=°"^"^-«y Fraction II has a 

a. a flow rate Of 100 ml/hr. ^d w^hed ^^^^^^^^^ Bulfi a 'T- " " ''""'^ 

with a 3.5 L gradient from 100 to 400 mM N»r? J « f containing 100 NaCI. Proteins are eluted 

. T. ONA polyLase. wrhZfs T.Z'.'^a, Tjo^^S '^Jt^C^:^"' ^""""'"^ 
KPot r^5irre:caX^^^^^^^^^ cml i:r;arrnrwL^^^ with .0 mM 

(380 ml) with Buffer B thlTplTto tie f ^^"""^ '^^^'^«°" ^'"'^ 2-fold 

Buffer B containing lOOmM K^Ce s e^el^^^^^ ^^"^^^ -"'^ ^00 ml of 

5 B at a flow rate of 60 ml/hr Fractions contLnlnn^ nwA f ^ * 100 to 400 mM KCI in Buffer 
This is fraction IV (370 mi). ^ polymerase which elutes at 300 KCI. are pooled. 

A column of DEAE-Sephadex A-50 (4 q rm2 v m 
7.0/0.1 mM dithiothreitol/0.1 mM EDtLiJ% ZerVlff^^^ nT7 .w' ^"'^ '"^ ^"^"^^d 

of 1 L Buffer C to a final conductivity equaUo ha! o K r I °" " ""'^""'^ "9"'"^* '^'^^"S^^ 

A column of BioRad HTP hydroxvlaoatite M Q rm2 v i \ • 

PH 7.4/10 mM 2-mercaptoethanol/2 rMTitateTooI g yS^ (Sr^rF^r V ifd' f T 

two changes of 500 ml Buffer D each n\^^^,:^^ri fr^^t:^. a/ • , . ^" ^ is dialyzed against 

and washed with 100 ml o7Buffer D P o e ns Ir^Id 'witTa Sn'" f ""J""" ^ '''' °' ^° 

PH 7.4 in Buffer 0 at a flow rate of 30 miSr ? ^ ^^"^ 0 to 180 mM KPO4. 

zzi.- ™- ^ 

SUM »«, 100% oxygen. TWs jvcl. is^l'.Zn "i? ° M subssquenlly 
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c9rn,,r.oZZZ:e Z^^^^ °- ^'^l ^H-abe.ed doub.e-stranded DNA (6 

3H-labeled single-sLded ^ oTTfreYarZ IZ'T. ^966). 
stranded 3H-.abeled T7 DNA wilh soTm Sit 27?^ , 5 '^^^^^ m,"V^ °' 

5 The standard exonuclease assay used is a mSm^tinn .mI T' neutralization with HCI. 

The standard reaction mixtureTiM ul rl^, T ^ "^^^^"^ ^hase et al. (supra) 

mM dithlothreltol. 60 nmoPH laLel i^^^^^ "f^"^' ''^ '° '"^ -^9^° 

polymerase. 3H-labeled double-str^dS^ ?na 1 ? J '"^ "^^'"^ °' ^7 DNA 

radioactively labeled DNA. sing e oJluWeS^^ndeS t "''h T ' ""''"-'V 

'0 single-or double-stranded DNA can be used for r "'^ "^^^y- 3' end labeled 

is stopped by the addition of 30 ul ^B^A Mm^l After .ncuba,ion at 37-0 for 15 min. the reaction 
at 10-C-45-C for 1-60 min. The dnaT;"^.": ^"' '^^Z' °' ^^'^ ^^)- "Hie assay can be run 
12.000 g for 30 min (5 min - 3 hr) im u J I ' ^entrifuged at 

~byadding!..o400u^:irdlir;:e=^^^^^^^^ ~^ 
unde^ir^irrri^Nr - - «de in 30 min 
units/mg. using the standard asC^nd^orstatTaZ ^'''''f 

modified T7 DNA polymerase depends ut»rthe eSlnt of^hl f !f " ^^^^^^'^^^^ 3c«^»y of the 
100-fold lower than that of native Ty DNrpoyrnSt or smT n '''^ 
20 assay conditions stated above. When double Sdlrft:, hi. ?^ """'""^ "^'"9 ^^"^^^'l 

fold higher. """'^ ^^^^'^ ^""^^^te is used the exonuclease activity is about 7- 

eightTo^for ^daT^i^^^^^^^^ TT ^ ^^"-'"^ ^ecay of 

reaction Will proceed'moreCdlyTrsrctir^^^ ""^^^en. otherwise 'he 

^5 2.5 mM DTT (0.3 ml of a fresh 250 mM sSS a^O mu! concentrahon of oxygen is higher. Once per day 
After eight hours, the exonucirse TcJitv J ?^ S^^^ 
negligible loss of polymerase act^ The 5^" losFmav hi"??"""^""." '^'^ ^^^"^^^ '0%. with 
exonuclease activity of half the polymerase molIr L 'tn °' "^""^'^'^ '"^^''^^'i"" of 

exonuclease activity in all the moSrSus a^^^^^^^^ " ' °' ^^'^ °' 

0 polymerase activity, half the molecules have n:rmrr„ ^" '""'^'^"'^^ ""^-^al 

of their original exonuclease act!vS exonuclease activity, while the other half have <0.1 % 

^^^^'^^U^O^^^^^^^ -yme is suitable, ..ough 

_ ~ greater than 0.. medication ^at^sr ^.S^ 

mMl■r;^S;^^:^l~ ^.'^-^^ ^=0 mi of 20 mM KPO. pH 7.4/0.1 

mg/ml) is stored at -20 • C. *° "^^ ^« '""'^'"^d T7 DNA polymerase (-4 

oxyg?n%;^rg~y^telt^^^^^^^^^ ^^^'^ ^'^-^ ^^^^^^ reactive 

possible reac«on mechanism flthe^g^rlfCrylS^^^^^^^ - -V^en. A 

(1) Fe2+ + O2 - Fe3+ + O2 



<5 (2) 20i + 2H^-.H202 + O2 

(3) Fe2+ + HaOz - FE^^ + OH' + OH" 
fn equation 1 



50 



55 



can -ca-. ■ -e superoxide radical 

can react with reduced metal ions toTrrhUrx^^TdiSrSH- 1^^^ "^^'^^ 
ox.d«ed metal ion is recycled to the reduced form hJ ^ 7 ^ ^'^"a^O" 3)- The 

These reacUve oxygen specirDrobaWv Tn.^. . ° '"^'^ ^ dithiothreitol (DTT). 
amino acid residues. Such d^na J is It^^^^^^ S ""T? ^'^'"'^^"^ ^"^^'"^ ^^-"'^ 
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35 



40 



45 



SO 



55 



Stimulated three-fold bv use nf inno/ t-l. 

transition metals due to , re presTnlTt^l T""" '^^^"^ °' 

preparations. ^ ^^"""^ '^^^ ''"^"""^^ °' '^^"sition metals (l-auM) in most buffer 

. che.?or:TrEDTr^^^^^ -aerobic condi«ons (e.g.. N.) and meta. 

may inf,ibit tt,e reaction, consiste t Sh^^^ ^.T'' "'^ '"'^^^''^^ '^'^"'"'^^ 

modified T7 DNA polymerase ° ""^^^ ^f^es in the generation of 

10 ONA sewncino wimout Ihe nLdTVZnl:..?^ """^ ^ ""^"^ » "sal In 

then screeSg f^rl^e mutani^ .'StTarL?; ^"'^""'^ "'"'^^^'-S 9-e 5 and 

Mutagenesis is performed as '0 0^^^^^^ f °' Po'v-rase activity. 

'5 plasmid containing an origin for s nale S.h ^ '^"'^"""^ ^ene 5 (e.g.. cloned in pEMBL-8. a 

promoter is prepared by sta dL^pt J^rf^^^^^^^^ ~ °' ^ ^ ^^^^ Po'v-erase 
which .ill mutate C's and Ps. and formTc t d "^0^X^.0 G'strA-' m'""" 

1985. The DNA is mutagenized at a dose which rS,! ^ ®* ^ Science 242. 

molecule. The single-sided r^XnTaed Pl^^^^^^^^^^ T '^'"^ '^'^^-'^ 

» above), and used as templates to syrthes^lne T 5 '^'^ 

randomly incorporated bases at Lj^Bonst^^^^^^^^ ^''^ synthesized strands contain 

"abnormal". This is the case the anZ of dATP 2' '"^^'r'*^' ^'"^'^ ''^ ^^"^osnized as 
DNA polymerase in place of d^TP However in tha T"""^ ^' ^"^'"^ incorporated by T7 

' polymerase, it is excised as rapid^T as ~co;^^^^^^^ '° «''°""'^'«ase of T7 DNA 

figure 6. using the alternating Ciymel poly dX ^^1^ "° '"'^ ^^"^'^^^'^^ 

extensive DNA synthesis only in tS presenc^of dATP ' w '"'^ polymerase catalyzes 

polymerase, because of its lircTi, TssS^ f not etheno-dATP. In contrast, modified T7 DNA 
a rate comparable to dATP l^ur us^^l Z^ 

precursors, native T7 DNA poWase is uniLT^ as a template, and dTTP and etheno^ATP as 
polymerase which has IoAs^'^Z^'^^ J^^^I^TT '""^ '^"^P'^'^' ^NA 

The procedure for lysing a^rsc'eenTna tl n ^ f *° "'^ ^^"'^^^'^ DNA. 

Acad. Sci. 2274. 1975). BrSly theTa^Gp^^^^^^^^^^^ f "^'T' ^^^^""^ ^^^^^^ Proc. Nat. 
Plasmids are transferred from t^e pm^rh the^ .h mutagenized gene 5-containing 

plate, to a piece of filter paper (-rXp2a"T S 'T"' approximately 200 colonies per 
to induce the T7 RNA polymerase whth S ?^ ^" ''^P^' ^^''^^^ P'^^^ ^t 42-C for 60 min 

produced from the chrC^l^gl^ s^;),^^^^^^^^^^ ™« constitutive, 

freeze thaw step to ensure cell lyste the S o^er diS! I '° '^'^ ^fter a 

etheno-dATP at 37-0 for 60 m?' STe L To^? '"'".'"''^ ^^^'^'^ I-^P^TTP and. 

unincorporated FpjdATP. DNA will prrcfpitaS on rfiir' ^""'^ ^'''^ ^^''^ »° ^^'"^-^ <he 

The washed filter paper is then usL Hx^se X-rav ^ tV"' ""'^'^"""^^ 

polymerase which is deficient in its exonucTase wi hVl r^'"' "^"^ '"^""^"^ ^ ^'^"^^ DNA 

by autoradiography. Colonies e^prZ^g t^: ^ DnI "Ze^ acid-insoluble -P. and will be visible 
def^ve in polymerase ac«vity. will notUar^rrararo^;^^^^ " "^""'"^ ^ ^ ^'^'^ 

color."Sllsrr:ain^^^^^^^^^ ^ Petri dish confining the original 

DNA polymerase Purified from eacrp eP^^^^^^^ T""^ ' "^'^ (°"e '"e^ anS T7 

mut^t. Those low in exonuc,eas:lTpCrforDrCeS °' """'^"^ ^'"^ ^^^^ 

^^Tx;:::^^ — - - ™ — of .e T7 

distinguish^ ^l";re^DrSvrrrr^n^^^ ^ •«'^'"--> - ^iso be 

St-., .us. w. modi^ed zs=. 
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presence of 0.5ug/m Shidtrbm^^^^^^^ "f"*"' ^el electrophoresis run in the 

^^..a.o.tLth:~^^^ 

exonlirrZ^o^fc^el'd?^^^^^^^^ *° -'-^^ ^''^ 9ene 5 

Dunn et al.. 166 rSZ BTl^''','^'t7L^T '''' 

mutated are also shown. ^"'^ sequences of the region of gene 5 
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Wild type sequence 

109* (aa) 



c^" S ctf ?cc S i?c ?S S iL* S '^^^ 

14677 .(T7 bp) "° CCC GGA AAA CGC TTT GGG TCT CAC OCT TTG GAG 



Mutation 1: His 123 Ser 123 

Priaer used; 5- CGC TTT GGA TCC Z£= GCT TTG 3' 

Mutant sequence: 

Mutation 2: Deletion of Ser 122 and Hi, 123 

Priner U3ed: 5 « GGA AAA CGC TTT GGC GCC TTG GAG GCG 3* 

A 

^ base deletion 

Mutant sequence: 

c~ J:-? iSS - - - - =^ -^^ 
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Mutation 3: Ser 122, Hiz 123 Ala 122, Glu 123 
Primer uied: 5' CGC TTT GGG fiCT fiAfi GCT TTG G 3 • 
Mutant sequence: 

CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GGG 1^ GCT WG (Sg 



Mutation 4: Lys 118, Arg 119 Glu 118, Glu 119 

Primer used: 5' 5* G CCC GGfi £AA QhSk TTT GGG TCT CAC GC 3« 

Mutant sequence: 

118 119 

ieu Xeu Arg Ser Gly Xys Leu Pro GJy SJji ^ Phe Gly Ser Ala Leu Glu 
CTT CTG CGT TCC GGC AAG TTG CCC GGfi fiAA 2^ TTT GGG TCT CAC GCT TTG GAG 

Mutation 5:. Arg 111, Ser 112, Lys 114 Glu 111, Ala 112, Glu 114 
Primer used : 5' G GGT CTT CTG fi^ QCC GGC fiAG TTG CCC GG 3 ' 

Mutant sequence: 

111 112 114 

Leu Leu QJji ^Jj^Gly SJji Leu Pro Gly Lys Arg Phe Gly Ser His Ala Leu 
Glu 

CTT CTG £M fiCC GGC £AG TTG CCC GGA AAA CGC TTT GGG TCT CAC GCT TTG GAG 
Mutation 6: His 59, His 62 Ser 59, Ser 62 

Primer used: 5' ATT GTG TTC AAC GGfi ICC AAG TAT GAC G 3* 

Wild-type sequence: 

aa: 55 59 62 

Leu Xie Val Phe His Asn Gly His Lys Tyr Asp Val 
CTT ATT GTG TTC CAC AAC GGT CAC AAG TAT GAC GTT 
T7 bp: 14515 

Mutant sequence: 

59 62 
Leu Xie Vai Phe S^^r Asn Gly Ser Lys Tyr Asp Val 
CTT ATT GTG TTC ZQC AAC GG^ ICC AAG TAT GAC GTT 
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Mutation 7: His 82 Ser 82 

Prijner used: 5* GAG TTC ICC CTT CCT CG 3« 

Wild-type sequence: 



aa: 77 g2 

Leu Ajfl Airg Giu Phe His Leu Pro Arg Giu Asn 
TTG AAC CGA GAG TTC CAC CTT CCT COT Sg Hc 



T7 bp: 14581 
Mutant sequence: 



82 

Xeu Asn Arg Glu Phe £cx Leu Pro Arg Glu Asn 
TTG AAC CGA GAG TTC ICC rTT CCT CGT GAG AAC 



Mutation 8: Arg 96, His 99 -> Leu 96, Ser 99 
Primer used: 5' Clfi TTG ATT ICT TCC AAC CTC ' 3' 
Wild-type sequence: 



aa: 93 96 99 

Vai Xeu Ser Arg Lev lie His Ser Asn Leu Lys Asp Thr Asd 

T7 bp, ?i?2r^ Sc 

Mutant sequence: 

96 99 

Val Leu Ser Isa Leu lie ^ Ser Asn Leu Lys Asp Thr Asd 

GTG TTG TCA Clfi TTG ATT 2CT TCC AAC CTC AAG GAC ACC GAT 



Mutation 9: His 190 Ser 190 

Primer used: 5' CT GAC AAA ICT TAC TTC CCT 3' 
Wild-type sequence: 



aa: 185 190 

Leu Leu Ser Asp Lys His Tyr Phe Pro Pro Glu 
CTA CTC TCT GAC AAA CAT TAC TTC CCT CCT GAG 
T7 bp: 14905 

Mutant sequence: 

190 

Xreu Leu Ser Asp Lys £^ Tyr Phe Pro Pro Glu 
CTA CTC TCT GAC AAA XCT TAC TTC CCT CCT GAG 
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Mutation 10: His 218 Ser 218 



Primer used: 5* GAC ATT GAA ICT CGT GCT GC 3* 
Wild-type sequence: 



aa: 214 218 

Vai Asp He Glu Mis Ar^ Aia Aia Tip Leu X^eu 
GTT GAC ATT GAA CAT CGT GCT GCA TGG CTG CTC 
10 T7 bp: 14992 



75 



Mutant sequence: 

218 

Vai Asp He Glu -ggg Arg Ala Aia Trp Leu Leu 
GTT GAC ATT GAA ICT CGT GCT GCA TGG CTG CTC 



20 



Mutation ir: Deletion of aniino acids 118 to 123 

Pri^r used: 5- C GGC AAG TTG CCC GGfi GCT TTG GAG GCG TGG G 3' 

A 

18 base deletion 



Wnd-typc sequence: 
109 (aa) 

S S= S? J£ S S:a ^ SI Si iSa S '4: 

14677 (T7 bp) ^^'^ AAA CGC 7TT GGG TCT CAC GCT TTG GAG 



Mutant sequence 



117 



Leu Leu Ary Ser Cly Lys ieu P'o GJv , • 

« ooc' ^ ^ s ^'-t^.:::^':--^ 



Mutation 12: His 123 Glu 123 

Primer used: 5' GGG TCT SAfi GCT TTG G 3' 

Mutant sequence: 



e?" — "^^"^ ^-^^ ''^'^ 



123 



Giy 5er ^LLu Aia Leu Giu 



CG- -rr ( ' — 

CG. .CC Gv^ AAG TTG CCC CGA AAA CCC TTT GGG TCT GAG GCT TTG 



GAG 
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Muctlon 13 : (Aig 131. Lyj 136. Lys 140. Lys 144. Are 145 -» 
Clu 131, Cla 136. GIu 140. Glu 144. Glu 145) 



Pri»« u,«,: 5. GGT TAT CC GAG G« CAA TAC fi« GAC ,TT fiXC fii* AT= 



CTT GAA G 3 



10 



15 



Wild-type sequence: 

129(aa) 13X 135 

GJy Tyr Ary leu G2y Glu Het Lys Gly Glu Tyr Lyl A^o jur„ p. 
^3^!.?%^ ^ ^ 



20 



Mutant sequence: 



129 (a*) 131 

Gly 7yz Lau Gly Glu Met Slil Gly Glu Tvr GJu ^ »k 

GOT TAT £iS ^ ^ ^ L«u Glu Glu 

14737 {T7 bp) ^^'^ ^ ^ 7TT CAG ATG CTT GAA G 



25 



30 



follofrrhT"*^,?* ^^"^ ^ '"'^'^""'^ °' 'f^® -^"tant phage into K38/pGP1-2 as 

Extracts were partially purified by passage over a phosphoceilulose and DEAE A-50 column and 
aTsCrrSeT '"'"^^''^ ^'^"""^'^"^ ^'-"V. as described Ze^TT ruls 



35 



40 



Mutant 



Table 1 

SUMMARY OF EXONUCLZASE AND POL^nnyvcE 
ACTIVITIES OF T7 GEKE 5 mSa^^ 



Exonuclease 
activity. % 



Polymerase 
activity. \ 



45 



50 



[WDd-rypeJ 
Muou 1 

(IDs 123 Set 123) 
Mutant 2 

(A Scr 122, His 123) 
Mutants 

(Scr 122. His 123 -* Ala 122, Glu 123) 



[100]a 

10-25 
0.2-0.4 

<2 



[100]b 

>90 
>90 
>90 
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Table 1 

SUMMARY OF EXONUCLEASE AND POLYMERASE 
ACTIVITIES OF T7 GENE 5 MUTANTS 

Exonuclease Polymerase 
Mutant activity. \ activity. % 

Mmant4 

(Lysll8.Argll9->Glull8,Glull9) <30 >90 

Mutants 

(ArgllUScrll2, Lysll4-4 

Glu 111. Ala lii Glu 114) >75 >90 

15 Miitam6 

(His 59, His 62 -> Scr 59, Scr 62) >75 >90 

Mutant? 

(His 82 Scr 82) >75 >9o . 

20 

Mutant 8 

(Arg 96, His 99 -> Leu 96, Scr 99) >75 >90 

Mutant 9 

25 (His 190 -> Scr 190) >75 >9o 

Mutant 10 

(His218-*Scr218) >75 >9q 



30 
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45 b . 

DNA 



Mutant 11 
(ALysllS. Arg 119. Phc 120, 

Gly 121, Scr 122, His 123) <:0.02 >90 

Mutant 12 

(His 123 ^ Glu 123) <30 >90 

Mutant 13 

(Arg 131, Lys 136, Lys 140, Lys 144, Arg 145 

Glu 131. Glu 136, Glu 140, Glu 144. Glu 145) <30 >9o 

"""^Qof'!!^® activi-y vas measured on single stranded (3u]x7 
100% exonuclease activity corresponds to 5,000 un^.s/^;g 

""^^Sr^'f 'f''''^^ was measured using single-stranded calf t-ynus 
100% polymerase activity corresponds to 8,000 units/r" 



Of the seven histidines tested, only one (His 123: mutant 1) has the enzymatic activities characteristic 
50 of modified T7 DNA polymerase. T7 gene 5 protein was purified from this mutant using DEAE-cellulose, 
phosphocellulose, DEAE-Sephadex and hydroxylapatite chromatography. While the polymerase activity was 
nearly normal (>90% the level of the native enzyme), the exonuclease activity was reduced 4 to 10-fold. 

A variant of this mutant was constructed in which both His 123 and Ser 122 were deleted. The gene 5 
protein purified from this mutant has a 200-500 fold lower exonuclease activity, again with retention of 
55 >90% of the polymerase activity. 

These data strongly suggest that His 123 lies in the active site of the exonuclease domain of T7 gene 5 
protein. Furthermore, it is likely that the His 123 is in fact the residue t)eing modified by the oxidation 
involving iron, oxygen and a reducing agent, since such oxidation has been shown to modify histidine 
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residues in other proteins (Levine. J. Biol. Chem. 258: 11823. 1983; and Hodgson et al. BicK:hemistry 14: 
5294, 1975). The level of residual exonuclease in mutant 11 is comparable to the levels obtainable by 
chemical modification. 

Although mutations at His residues are described, mutations at nearby sites or even at distant sites may 
also produce mutant enzymes suitable in this invention, e.g.. lys and arg (mutants 4 and 15). Similariy. 
although mutations in some His residues have little effect on exonuclease activity that does not necessarily 
indicate that mutations near these residues will not affect exonuclease activity. 

Mutations which are especially effective include those having deletions of 2 or more amino acids, preferably 
6-8. for example, near the His-123 region. Other mutations should reduce exonuclease activity further or 
completely. 

As an example of the use of these mutant strains the following is illustrative. A pGP5-6 (mutation 11)- 
contalning strain has been deposited with the ATCC (see below). The strain is grown as described above 
and induced as described in Taber et al. J. Biol. Chem. 262:16212 (1987). K38/pTrx-2 cells may be added 
to increase the yield of genetically modified T7 DNA polymerase. 

The above noted deposited strain also contains plasmid pGP1-2 which expresses T7 RNA polymerase 
This plasmid is described in Tabor et al.. Proc. Nat. Acad. Sci. USA 82:1074, 1985 and was deposited with 
the ATCC on March 22, 1985 and assigned the number 40.175. 

Referring to Fig. 10. pGP5-6 includes the following segments: 

1 . EcoRI-Sacl -Sma l-BamHI polylinker sequence from M13 mpIO (21bp). 

2. T7 bp 14309 to 16747, that contains the T7 gene 5. with the following modifications: 

T7 bp 14703 is changed from an A to a G, creating a Smal site. 
T7 bp 14304 to 14321 inclusive are deleted (18 bp). 

3. Sall-Pstl-Hindlll polylinker sequence from Ml 3 mp 10 (15 bp) 

4. pBR322 bp 29 (Hindlll site) to pBR322 bp 375 (BamHl site). 

5. T7 bp 22855 to T7 bp 22927. that contains the T7 RNA Polymerase promoter <^»10. with BamHl linkers 
inserted at each end (82 bp). 

6. pBR322 bp 375 (BamHl site) to pBR322 bp 4361 (EcoRI site). 

DNA Sequencing Using Modified T7-typg DNA Polymerase 

DNA synthesis reactions using modified T7-type DNA polymerase result in chain-terminated fragments 
of uniform radioactive intensity, throughout the range of several bases to thousands of bases in length 
There is virtually no background due to terminations at sites independent of chain terminating agent 
incorporation (i.e. at pause sites or secondary structure impediments). 

Sequencing reactions using modified T7-type DNA polymerase consist of a pulse and chase. By pulse 
IS meant that a short labelled DNA fragment is synthesized; by chase is meant that the short fragment is 
lengthened until a chain terminating agent is incorporated. The rationale for each step differs from 
conventional DNA sequencing reactions. In the pulse, the reaction is incubated at 0-C-37-C for 0.5-4 min 
in the presence of high levels of three nucleotide triphosphates (e.g.. dGTP. dCTP and dTTP) and limiting 
levels of one other labelled, carrier-free, nucleotide triphosphate, e.g.. ps] dATP. Under these conditions 
the modified polymerase is unable to exhibit its processive character, and a population of radioactive 
fragments will be synthesized ranging in size from a few bases to several hundred bases. The purpose of 
the pulse is to radioactively label each primer, incorporating maximal radioactivity while using minimal 
levels of radioactive nucleotides. In this example, two conditions in the pulse reaction (low temperature, e.g 
from 0-20 -C. and limiting levels of dATP. e.g., from 0.1 uM to luM) prevent the modified T7-type DNA 
polymerase from exhibiting its processive character. Other essential environmental components of the 
mixture will have similar effects, e.g. limiting more than one nucleotide triphosphate or increasing the ionic 
strength of the reaction. If the primer is already labelled (e.g., by kinasing) no pulse step is required. 

In the chase, the reaction is incubated at 45-C for 1-30 min in the presence of high levels {50-500uM) 
of all four deoxynucieoside triphosphates and limiting levels {1-50uM) of any one of the four chain 
terminating agents, e.g.. dideoxynucleoside triphosphates, such that DNA synthesis is terminated after an 
average of 50-600 bases. The purpose of the chase is to extend each radioactively labeled primer under 
conditions of processive DNA synthesis, terminating each extension exclusively at correct sites in four 
separate reactions using each of the four dideoxynucleoside triphosphates. Two conditions of the chase 
(high temperature, e.g.. from 30-50 -Q) and high levels (above 50uM) of all four deoxynucieoside 
tnphosphates) allow the modified T7-type DNA polymerase to exhibit its processive character for tens of 
thousands of bases: thus the same FX)lymerase molecule will synthesize from the primer-template until a 
dideoxynucleotide is incorporated. At a chase temperature of 45-C synthesis occurs at >700 
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45 



50 



55 



.evels Of .riphosphates'.h;oughoutTeC^^^^^ " '^'^'^ - 

. c.-n,^-^ can .e use. in ONA se..en- 

primer. then no pulse step is required " fluorescent analog is at the 5' end of the 

the pulse reaction time, the longer thHriml Z '"J^ t f o °J ♦^"^'"^«"9 agents, the longer 
w nucleotides/sec. Second is the ratfo Tr,^ i e'rtens.ons. At O'C the polymerase extensions average 10 
Chase reaction. A ^^Z ^Ze O^^^^^^^ ''''°T''' '^^'"'"^^^^ 

these analogs, thus the average lengTof' eZ " in 1".° 

ynucleoside triphosphate concentration to the S tLminlt " °' ^''^ '^«°'<- 

Thus. in order to shorten the average s ze ol ^e eJSn? 7.'^'? concentration in the chase reaction. 
'5 and/or the ratio of chain terminaS aoTn! tn I ^'^'"f P"'«« shortened, e.g.. to 30 sec. 

Chase reaction. This cant. Z^JT'^^T'^'''' triphosphate concentration is raised in the 
lowering the concentration of deoxCdeosir 1^^^^^^ °' agent or 

extensions, the pulse time is increasereTTs^ 2^^^^^^ '«"9th of the 

is lowered (e.g.. from 20uM to 2«M) in th^chl rlacL o' chain terminating agent 

Exarnp!e2:DNAsequ^ 

srrrje^j^ir r ^^^^^ -"-"^^ - ^ 

« is mixed with 1 m Of complementary sequencino / J P'°^edures) at 0.7 mM concentration 

and 2.5 ul 5X annealing b'u ffer (2^mM T^X^fso nliT^^^^^ ' ' ' 

slow cooled to room temperature over 30 rnin In tL n. I ? ^^''^ ^'^^^^'^ '° ^^'^ ^ min. and 
mixed with 1 m dithiothrL 0.1 mTu, orj^dNlS S^P Hr™ "J^' °' '"^^ ^""^^'^^ mix was 
TE). 2.5 m [.3ss]dATP. (1500 C.7mmo7 Net EnaUn?N ^^^^^ ^^^^ ^ '"^ each (P.L Biochemicals in 

0 described in Example 1 0.4 mg/mT 2500 un^t^^l '^T^'l^" ' °' ""^"^ ^ ^l^^ polymerase 
after vortexing and centri uging ^a nficlcTtT L V^- ' ' r""^ "'C' ^ min, 

min and temperature can vary^from o ' C to S c lZJ^ °' '^^'^ ^0 sec to 20 

from the primer. ^ ^ to 37 C. Longer times are used for determining sequences distant 

G. A. T. or C (P-L Biochemicals) The so^rifLh/' ^"'^""'^ ^'t^er dideoxy (dd) 

dATP imM. 0.5 «, 5X anneatg\:t%?rn.S T^^^^^^^ ^5 M n^' ^""^ ^""^^"^^"^ ^' 

UM (where ddNTP corresponds to ddGAT or r in Z. ' ""^ ^"^ ^ ^ l^' ^^NTP 100 

at 45.C (or 30 • (>50 • C) tor 7o min td then 6 ul of T '"f^^^" "^"^ ''^^^^^ 
xylenecyanol) is added to each tur'and the tut^ ol tk" '^"^"^ ^'^T^' "-^ % 

The sequencing reactions are L on sta^d^d fiof ! h'""^ '^""^ ^"3° 

Watts for 6 hours. Prior to running on a Sel ^e^Lt, P^'^t'"^ """^^ sequencing gel in 7M urea, at 30 
10% acetic acid. 10% methanol. drTon a ge^d^^^^^ 

autoradiography film overnight. °" ^ gel dryer, and exposed to Kodak 0M1 high-contrast 

&Me3:DNAse3uencinai^ 

deo:.:^ rho?p=;^^^^^^^ -9 -ting levels of al, four 

protocol in example 2. Rrst the Dulse tlZ^. . °^ ^ """"^^^ °' advantages over the 

necessary to interrupt a L L^^,TTZZ"1" ""'^'^ P~'°<^°' " w J 

method it is easier to control tSTx^enf of the e,S^^ r""""" '° Second, with this 

P JerXe" oTpXrei^jr 2^41^/ '^^'^ °'-P'-n»a^ ^equendng 

250 mM NaCI) heated at 65'cTl m^n and ""^ 50 mM MgCfe 

reaction 10 ul of the above annla^^m x" Tas m2 '? t«"'P«'^«"^e over 30 min. In the pulse 
dCTP. dTTP) 1.5 UM each asLTf^^sSAHar^^ M. 2 ul of 3 dNTPs (dGTP. 

lla ajOATP. (alOuM) (about lOuM. 1500 Ci/mmol. New England 
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75 



20 



30 



35 



1-60 min.. respectively.) ^ '^"^^^^^^^'^ and time of .ncubalion can be varied from 20-C-45-C and 

dTTP. d6TP) 200 uM each, a d 1 0 Si ddSV^so um £h T ' ^' '^'^^''^ <^'^T"P- ^^^TP. 
(or 20-C-45-C and 1-60 mrresDec iveM 1^^^ 

' EDTA. 0.05% xylene-cyanoradrd toT^^^^^^^^^ f (^5% formamide. 20 mM 

polyacrylamide sequencing ge. as descrb^, abo^e °" *° "^""'"^ °" « standard 

Exa!I2P!e4:Replaceme^ 

s.f rrn:~^^^^^^^^ secondary 

reactions are performTwiS, ^e ab Je de^^^^^^^^ T'^,, "'^"'"'^^ °' ^sb. Preferably tt,eL 

reason., or£2. in.. andrar2Vrtrrer^^^^ " °' "^^^ 

dGTP is used. ,n the Zo"":::: T^'.^ritz u^:r(n^t Zpr"" "° " ^^-^^ 

onuJLlL'TardZ ? - '-'^ (^''-t 0.01 units, o, ex- 
used. It is recommended t at t^eZr d^^^^^^^^^^ ""^^^^ ^ ^'^P 
Of, the four reactions using dGTP TboZc^^^t^^^^^^ T'' «'<'^'"S'-°" 
can be minimized by: (1) Lving dGTP a^d dlTrouf Jmh? I ' """"'"^ °' ^^'^"'^^^ '""'^s 
mixes can be used fo both dG?P and dlTP rLc^ fo ^ '^'"'^'^ ""^^"^ '^"^ ^hase 
dlTP (2m at 0.5 mM and 1-2 5 mM esP^^^^^^^^^^ ' -°"'=-tration of dGTP or 
each cont^n a low concentration of dX^^^^iS T "''^^^ 
or dITP. This modification does not usual radverselveffprZ' either dGTP 
reduces the required number of pulse and L^^^^ f °' sequencing reactions, and 

The sequencing reaction iJi o example 3 exiprthTt^'otfTe^ 

for dGTP- 1 5 uM dCTP http o«h ^ tl ^^^t; ^ ^ P"'^® "^'^©s contain a) 3 dNTP mix 

UM dATP.dTTP and dCTP and arl n^ZT Jl ^ ^^ase mixes contain 30 

dITP does not exceed 5 m^. d.deoxynucleotides at 8 uM). and the chase time using 



40 Deposits 



45 



50 



number 67571. ^" C>ecember 4, 1987. and assigned the 

die ^S^tlJ'erd'olt terror: r "r"""^ *° ^^'"^^^ ^►^-^ «^ey 

years, whichever is the longer and Js re oo^^^^^^^^^ T'l"""' ^ "^'t^^^- °^ 30 

patent, at which tim^U de^Jts w H ^^T ' °' '^'^^'^ °' ^"ch a 

deposits will be made IrevSraviwe 7'°^'''' '° P"'""^- ^"^^ ""'^ 

Section 1-14 and 35 USC sSS. 112 ' Co'^miss.oner of Patents under the terms of 37 CFR 



Claims 



55 



m.;:ifirgel''Srg'en^^ '"■^'"^'^ --P^ses expressing a 

polymerase ac^vrfofui i Tna sequ^ J^oTnTH °T '^'^"^^^ "'''^ 
necessary for s.d Po^merarj^ricrrerurr^e^^^^^^ 
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«y DNaI;? " ^^''^ 3' ■ 5' --"-^'^-^ d-ain of said 

the site of substitution or are deleted so as to reduce the activity of naturally occurrina 3' - 5- 
exonuclease activity of the naturally occurring DNA polymerase. ^ 

^ ~ — ~ — ^ r 

20 

» 9. A m,ft«d according to claim 7 cha,.cl,.isad in th., s., 122 and His ,23 .,, raplacad or delalM. 

«. A n,«h=d acc«.i„9 » claim 7 ch.™:,arta«, in M =„l„« .cid residua L,a „8 ,, Hb ,23 „, 

" '^^^'Z^ ~ - — 

"^^^^'^fT^ " "» '■■^ <« •"■> 0^ '« 0, 

naiurauy occurring T7 DNA polymerase are replaced or deleted 

40 

^ivmtrrl ^ ^ ^ ^ 5' exonuclease activity of the naturally occurrina DNA 

ST?7 HNA T" °' ^'"^ °' '"^^ exonuclease domain within the amino^eZnal haJ 
olh^ p'SS:™ ^"^^ corresponrgTrol 

the Site oftbsmulrrarrdTeted ' ' " ""^^ "^^^^"^ 

the^'eio^ctefi!? 9«"e according to claim 14 characterised in that one or more of the amino acids of 
Z I? .^"^ ^'""^ *° ^'"e «sidue 224 of T7 DNA p^ymeSe o 

said naturally occumng DNA polymerase, or the corresponding domain of other T7^rS DNA p^ly 



45 14. A 
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19. 



rSeJ' " '"^^ '''' ^--"9 at the site of substitution or 

'° °' ""^^ " ' *»»=.eH^ in tha, said gene .ncoae. a 

»5al«S.'™"""' '° " i" « Se, ,22 ,„„ ,23 a,e raplacM 

" ^=v^:z"T/"^/sraC^^^ ^" " - - - - 

^ ?r,1T.™^^?*f.r' " """" " in ftal A,9 ,3,, L,s ,36 Lys ,« 

,44 M A,9 ,45 o, m, nawal^, ocowdng T7 DNA polym.^ a,, replaid « ai? ' 

Patentanspruche 

'^'r.T;r:rJ^^^^ r"''^^" DNA-Po.y.arase. bei deo, ein modifi^iertas 

Polymerase entfernt odT ri?!^h T ' 5 "Exonukleasedomane der naturlich auftretenden DNA- 



1. 
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OmTo^ZeToZll:^!' • ^^^"-"Lf ^---^"-t. die Polymerase eine modifizierte 
^7y^DmiolZ^S' ExonukleaseakWvita, der naturlich auftrete den 



nml^r "ach einem der AnsprOche 1 bis 3. dadurch gekennzeichnet. da6 das Gen fur eine 
iZSnl ^'^ P- "^""granim Polymerase 

5. Vertan,en nach jinem aer AnsprtJcte ! bis 4, .tedurOi gekennzelchTOt. daS das Gon modifelert Isl um 

^S*"" S*'"^'""". ^ <22 ™d HIS ,23 „saa od., a«OT. 

« <«• A-mosSu,ob.«ina L,s 1,8 bis His 

iTzri^T^/^~ir^-'' '-^ - ™ - 
" rdr:arr^-- - ,« .d 

r.^lfrH "^"' Anspruche 1 bis 12. bei dem die modifizierte prozessive DNA-Polymerase 

Sequenz.er^ngsreakt.onen verwendet werden. wenigstens 500 Basen lang an DNA oebundenTu 
S;St . ^^'^^"^""9 ^' ONA-Sequenzierung eine ausreichendToNA pXme^SLlJ 

"'"^"^ -usammengebrih, wird derTSL 

rLrrpr^Ls— ^^^^ 

naturlicherweise vorhandenen AminosSure unterscheidet. trsatzstelle 

?mil"if ' ""f'^'^^"^^ "^'^'^ Anspruch 14. dadurch gekennzeichnet. da6 eine oder mehrere der 
Ammosauren der ExonukleasedomSne von dem Aminoende zu dem Aminosaurebau^Ji Sf d^S 

T^TomZ^^T^' en.sprechende?S,ma„e 

Zr II 7P;°NA-Po ymerasen entfemt oder durch eine AmlnosSure ersetzt ist bzw sind die sich von 

der an der Substtuhonstelle natOrlich auftretenden Aminosaure unterscheidet 
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0 



5 



i'rSt h'""'*'*""'*^' "^"^ '^'P^'^' gekennzeichnet. daB die Polymera- 

dri^^;^^or;o~^^^^ T7-Tvp-Po„.erasa wenigstens ao-. d. natCHIch auL^e^ 

17. Gereinigtes modifiziertes Gen nach einem der AnsprUche 14 und 15. dadurch gekennzeichnet daB die 
Po^mer^e erne ExonukleaseaktivitSt aufwelst. die wenigstens 50% kleiner ist als die naWrlfch 
auftretendeExonukleaseaktivitatdernatarllchauftretendenT7-Typ-DNA-PolymeTase 

ia Gereinigtes modifiziertes Gen nach einem der AnsprUche 14 bis 17. dadurch gekennzeichnet dafl das 
n!^Z I TT"^ DNA-Polymerase codiert. die modifiziert ist. urn die /Sivitit der 

zu vXem ^-"^•^--"-'•t^t au, weniger als 500 Einheiten pro Mi'lligramm pSymerase 

SntS? .'"°'^"'"'f^"^ ^i"^"^ AnsprUche 14 bis 18. dadurch gekennzeichnet daB das 

Gen mod,f,z,ert wurde. um d.e naturlich auftretende Exonukleaseaktivitat zu eliminieren. 

Slaifil'^dT^^^^^ Gen nach einem der AnsprUche 14 bis 19. dadurch gekennzeichnet. daB ein 
em!mT '■'-^°""^'«^«^°'^a"« "etOrlich auftretenden DNA-Polymerase ersetzt oder 

21. Gereinigtes modifiziertes Gen nach einem der AnsprUche 14 bis 20. dadurch gekennzeichnet dafl die 
prozessive modifizierte T7-Typ-DNA-Polymerase eine T7-DNA-Polymerase ist. 

a^flSSen'^T^'S^^^^^ ^'''"'^ 9-kennzeichnet. daB His 123 der naturlich 

aunretenden T7-DNA-Polymerase ersetzt oder entfernt ist. 

'^^:i'lr'::^^Z''''' ""'^^^^ ^^^-^^^ gekennzeichnet. daB Ser 122 und His 123 

^ iTnT^rK"""?" ^'^^^^^ "^'^'^ ^^^'"""^ 21. dadurch gekennzeichnet. daB Lys 118 und Arg 119 
der naturlich auftretenden T7-DNA-Polymerase ersetzt oder entfernt sind. ^ 

26. f «'e;"'9tes jTiodifiziertes Gen nach Anspruch 21. dadurch gekennzeichnet. daB Arg 131 Lys 136 Lvs 
140. Lys 144 und Arg 145 der natUrlich auftretenden T7-DNA-Polymerase ersetzt oder entfernt liS. 

" 1 CeS Z:::^!^:^^- ONA.Po,merase. die nach dem Verfahren nach Anspruch 

Revendlcatfons 

^' m^f P[o^^o^on d'une ADN polymerase modifiee et purifi^e. selon lequel on exprime un qfene 
ocJJ!! P"'^^^^^^^ Progressive modiflee ayant une 3^ d'ADN 

associ^e a tout co-facteur n^cessaire H cette activity d'ADN polymerase, celui-ci resultant de la 
mod.f.cat.on d'un gene naturel modifie par remplacement d'un ou plusieurs am^oiddes Ja^s le 

rvlTnaLXn^r 'T'^l"'"'' amlnoaclde aSete ceTu 

Jouvant naturellement au site de substitution, ou par deletion de ceux-ci de facon k r^duire factivite 
d exonucl6ase naturelle 3'-5' de I'ADN polymerase naturelle. 

^ ^.'^ff revendication 1. caracterise en ce que ractivite de polymerase de rADN polymerase 

modifiee. est d'au moins 90 % de celle de I'ADN polymerase naturelle polymerase 

^ ZT^f k"J! 1 O" 2- caracterise en outre en ce que la polymerase est une ADN 

% ^"°PS^9ique de type T7 modifiee. ayant une activite d'exonuc/ase 3^'. au moinfso 
^ infeneure h I'actnrite d'exonuciease naturelle de I'ADN polymerase de type T7 naturelle 
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ADN^l'^lr '^^^ revendications 1 a 3. caractSrise en ce que le gfene code pour une 

moms de 500 unites par mg de polymerase. - . » 

^' S?rr" revendications 1 a 4. caracterise en outre en ce que le gene a ete 

modifie de fagon a suppnmer I'activlt^ d'exonucl^ase naturelle de fADN polymerase naturelle 

7. Proc4d^ seion l-une quelconque des revendications 4 a 6, caracterise en que I'ADN polymerase 
progressive modiMede type T7 est une ADN polymerase T7. i^in polymerase 

8. Precede selon la revendication 7, caracterise en ce que le residu His 123 de I'ADN polymerase T7 
naturelle. est remplace ou supprime. polymerase i / 

^" ou°s1ppVm?s" " '^^"•^^"^^ ^" ''"^ '^^ Ser 122 et His 123 sont remplaces 

72^sont%l^!r"'^''^'^°" ^' ""^'^"^"^ ^" ''"^ '^^'■''"^ d^"^es d'aminoacide Lys 118 S His 

11. Proced6 selon la revendication 7 ou 9. caracterisd en ce que les r^sidus Lys 118 et Arg 119 de I'ADN 
polymerase T7 naturelle. sont remplaces ou supprim^s. «rg i la ae i aun 

ll'ZlVt^ell^Tr^'""'"" '•if^^'^^"^^ ^« ^^^'dus Arg 131. Lys 136, Lys 140. Lys 144 

et Arg 145 de I ADN polymerase T7 naturelle. sont remplaces ou supprimes. 

13. Precede selon Tune quelconque des revendications 1^12. dans lequel I'ADN polymerase progressive 
modifi^e. est capable de rester Ii6e ^ de I'ADN sur au moins 500 bases dans es ZdS s 
normalement employees pour des reactions de sequengage de I'ADN. conditions 

rnnHi^L?' ^^'^ de rester li^e a de I'ADN sur au moins 500 bases dans les 

condjions normalement employees pour des reactions de s^quengage d'ADN. et ayant une aSviJ 

s^lSn"^ ""T""'' '''' s^quengage d'ADN lorsque cette pJ^Sa e 

est a^oc.6e h n'.mporte quel facteur d'h5te n^cessaire k cette activity d'ADN polymerase rTsultant de 

1^4 P"^^'^^^^ "f-e'te- "n ou plusieurs aminoacldes du domaine d'exonuclease compris dans la 
ses deTr^T. r " °" correspondan, d'autres AD^polymla- 

!nhc«L^ etant remplaces par un aminoacide autre que celui naturellement present au site de 
substitution, ou ceux-ci etant supprimes. 

TLZ'^n^f. Purifie selon la revendication 14. caracterise en ce qu'un ou plusieurs des aminoacides 
^4 de ?ADN r""f' amino-temiinale jusqu'au residu derive d'aminoacide 
?i ^7 Z relTS"' ^ °" '""^^'"^ correspondant d'autres ADN polymerases de 

^,Z,7, '^'"P'^^s P3^ aminoacide autre que celui naturellement present dans le site de 
substitution, ou ils sont supprimes. 

de la noSr!!'"'"^ " °" caracterise en ce que I'activite de polymerase 

17. Gene modifie et purifie selon la revendication 14 ou 15. caracterise en ce qu'en outre la polymerase a 
j:rym?r::rnr.r^dr^^"T7^^^^^ " '-'^^-^^ ^ ^-^^ — - nature^e^^^^N^ 
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1& G6ne modifie et purifig selon I'une quelconque des revendications 14 k 17. caracterise en ce que le 
gene code pour una ADN polymerase progressive et modifi^e de fagon a r^duire factivitg d'exonucl^a- 
se naturelle, a moms de 500 unites par mg de polymerase. 

19. 'nodifig et purlfie^ selon I'une quelconque des revendications 14 a 18. caracterise en outre en ce 
naoirelle ^ ^ supprimer I'activlte d'exonucMase naturelle de I'ADN polymerase 

T^f u* ^"""1"^'°" ''""^ quelconque des revendications 14 S 19. caracterise en outre en ce 
supprime d-exonuctease 3-5- de I'ADN polymerase naturelle. est remplace ou 



21. Gene modifie ou purifie selon Tune quelconque des revendications 14 S 20. caracterise en ce que cette 
ADN polymerase progressive modifiee de type T7 est une ADN polymerase de type T7. 

^' SllZf '?7^' t""? " ''^^r^'"^*'"" "^""^^"^^ ^" ''"^ ^^sidu His 123 de I'ADN 
polymerase T7 naturelle est remplace ou supprime. 

2a Gene modifie et purifie selon la revendication 21. caracterise en ce que les residus Ser 122 et His 123 

sont remplaces ou supprim^s. 

26. Gene modifie et purifie selon la revendication 21. caracterise en ce que les residus Arg 131 Lys 136 
Lys 140. Lys 144 et Arg 145 de I'ADN polymerase T7 naturelle. sont remplaces ou supprimes'. 

i.*s;"s:u:nX'c?ADt"" ^^^^^""^ 



26 



EP 0 386 859 B1 




27 



EP 0 386 859 B1 



FIGuRT 2 
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FIGURE 3 

PACYC177 ^ ^S869 
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FIGURE 4 
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FIGURE 6 
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1. Denature ONA 

2. Aftneal primers 

3. ONA synthesis 



a b c 



a o c 



B 



1. Denature ONA 
Z. AnneaJ primers 
3. DNA synthesis 



Previously synthesized strands 
now serve as templates. 



a b c 



a o c* 



1. Denature DNA 

2. Anneal primers 

3. DNA synthesis 



Repeal cyde of denaturaiion. annealing, 
and DNA synthesis 1 6 more Umes. 
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Region between two primers 
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FIGCRE 7 



10 

TTCTTCTCAT 
60 

TCTGGCGTCA 
110 

AATCACTGCA 
160 

GTTTTTTGCG 
210 

TGTTGACAAT 
260 

AACAATTTCA 
310 

GTTACACCAA 
360 

ATATGAGCGA 
410 

GTACTCAAAG 
4 60 

CGG7CCGTGC 



20 

GTTTGACAGC 
70 

GGCAGCCATC 
120 

TAATTCGTGT 
170 

CCGACATCAT 
220 

TAATCATCGG 
270 

CACAGGAAAC 
320 

CAACGAAACC 
370 

TAAAATTATT 
420 

CGGACGGGGC 
470 

AAGATGATCG 



30 

TTA7CATCGA 
80 

GGAAGCTGTG 
130 

CGCTCAAGGC 
180 

AACGGTTCTG 
230 

CTCGTATAAT 
280 

AGGGGATCCG 
330 

AACACGCCAG 
380 

CACCTGACTG 
430 

GATCCTCGTC 
480 

CCCCGATTCT 



40 

CTGCACGGTG 
90 

GTATGGCTGT 
140 

GCACTCCCGT 
190 

GCAAATATTC 
240 

GTGTGGAATT 
290 

TCAACCTTTA 
340 

GCTTATTCCT 
390 

ACGACAGTTT 
440 

GATTTCTGGG 
490 

GGA7GAAATC 



50 

CACCAATGCT 
100 

GCAGGTCGTA 
150 

TCTGGATAAT 
200 

TGAAATGAGC 
250 

GTGAGCGGAT 
300 

GTTGGTTAAT 
350 

GTGGAG7TAT 
400 

7GACACGGA7 
450 

CAGAG7GG7G 
500 

GC7GACGAA7 



33 



EP 0 386 859 B1 



TlGJr^ 7 (continued) 



510 

ATCAGGGCPJ^ 
560 

ACTGCGCCGA 
610 

AAACGG7GAA 
660 

TGAAAGAGTT 
710 

TGCCCCGTCG 
7 60 

GTTGACGGAT 
810 

CAACAACGAA 
860 

CGATAAAATT 
910 

AAGCGGACGG 
960 

TGCAAGATGA 
1010 
CAAACTGACC 
1060 
CGAAATATGG 
1110 
GAAGTGGCGG 
1160 
GTTCCTCGAC 
1210 
TCGCTAAAAA 
1260 
GATCCCCCTG 
1310 
ATGCAGCTCC 
1360 
CAGACAAGCC 
1410 
CAGCCATGAC 
1460 
ATGCGGCATC 
1510 
ATACCGCACA 
1560 
TTCCTCGCTC 
1610 
TATCAGCTCA 
1660 



520 

ACTGACCGT7 



570 

AATATGGCAT 
620 

GTGGCGGCAA 
670 

CCTCGACGCT 
720 

CTAAAAACTG 
770 

CCCCGGGGA7 
820 

ACCAACACGC 
870 

ATTCACCTGA 
920 

GGCGATCCTC 
970 

TCGCCCCGAT 
1020 
GTTGCAAAAC 
1070 
CATCCG7GGT 
1120 
CAACCAAAGT 
1170 
GCTAACCTGG 
1220 
CTGGACGCCC 
1270 
CCTCGCGCGT 
1320 
CGGAGACGGT 
1370 
CGTCAGGGCG 
1420 
CCAG7CACGT 
1470 
AGAGCAGA77 
1520 
GA7GCG7AAG 
1570 
AC7GAC7CGC 



1710 
7AAAAAGGCC 
1760 

1810 



1620 

C7CAAAGGCG 
1670 
rt'orunurt* G . G 
1720 
GCG77GC7GG 
1770 
AAA7CGACGC 
1820 
ACCAGGCG77 



530 

GCAAAAC7GA 
550 

CCG7GG7A7C 
630 

CCAAAG7GGG 
680 

AACC7GGCG7 
730 

GACGCCCGGC 
780 

CCG7CAACC7 
830 

CAGGC77A77 
860 

C7GACGACAG 
930 

G7CGA777C7 
980 

- 7C7GGA7GAA 
1030 
7GAACA7CGA 
lOSO 
A7CCCGAC7C 
1130 
GGG7GCAC7G 
1130 
CG7AAGGGAA 
1230 
GGCG7GAG7C 
1290 
77CGG7GA7G 
1330 
CACAGC77G7 
13S0 
CG7CAGCGGG 
1430 
AGCGA7AGCG 
1460 
G7AC7GAGAG 
1530 
GAGAAAA7AC 
15S0 
7GCGC7CGG7 
1630 
G7AA7ACGG7 
1650 
AGCAAAAGGC 
1730 
CG77777CCA 
1730 
. wAAG7CAGA 
1830 
7CCCCC7GGA 



540 

ACA7CGA7CA 
590 

CCGAC7C7GC 
640 

7GCAC7G7C7 
690 

AAGGGAA777 
740 

G7GAG7CA7G 
790 

77AG7TGG77 
840 

CC7G7GGAGT 
890 

77T7GACACG 
940 

GGGCAGAG7G 
990 

A7CGC7GACG 
1040 
7CAAAACCCT 
1090 
7GC7GC7G77 
1140 
7C7AAAGG7C 
1190 
777CA7G77C 
. 12« 
A7GC7AAC77 
1290 
ACGG7GAAAA 
1340 
C7G7AAGCGG 
1390 
7G77GGCGGG 
1440 
GAG7G7A7AC 
1490 
7GCACCA7A7 
1540 
CGCA7CAGGC 
1590 
CG77CGGC7G 
1640 
7A7CCACAGA 
1690 
CAGCAAAAGG 
1740 
7AGGC7CCGC 
1790 
GG7GGCGAAA 
1840 
AGC7CCC7CG 



550 

AAACCC7GG7 
600 

7GC7G77CAA 
650 

AAAGGTCAG7 
700 

CA7G77CGGG 
750 

C7AAC7TAG7 
800 

AA7G77ACAC 
850 

TATA7ATGAG 
900 

GATGTACTCA 
950 

G7GCGGTCCG 
1000 
AA7ATCAGGG 
1050 
GG7AC7GCGC 
1100 
CAAAAACGG7 
1150 
AG77GAAAGA 
1200 
GGG7GCCCCG 
1250 
AG7G77GACG 
1300 
CC7C7GACAC 
,1350 
A7GCCGGGAG 
1400 
7G7CGGGGCG 
1450 
7GGC7TAAC7 
1500 
GCGG7G7GAA 
1550 
GC7C77CCGC 
1600 

1650 
H i ^ ^ ^ * — « 



1700 
CCAGGAACCG 
1750 
CCCCC7GACG 
1800 
CCCGACAGGA 
1850 
7GCGC7C7CC 
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FIGLru:. 7 (continued) 



1860 
TGTTCCGACC 
1910 
GAAGCGTGGC 
1960 
TAGGTCGTTC 
2010 
CGACCGCTGC 
2060 
GACACGACTT 
2110 
GCGAGGTATG 
2160 
CGGCTACACT 
2210 
TTACCTTCGG 
2260 
GCTGGTAGCG 
2310 
AAAAGGATCT 
2360 
AGTGGAACGA 
2410 
AGGATCTTCA 
2460 
CTAAAGTATA 
2510 
GTGAGGCACC 
2560 
TGAC7CCCCG 
2610 
CCCCAG7GCT 
2660 
TATCAGCAAT 
2710 
GCAACTTTAT 
2760 
AGTAAG7AGT 
2810 
CAGGCATCGT 
• 2860 
GGTTCCCAAC 
2910 
AGCGG7TAGC 
2960 
CAG7G77A7C 
3010 
A7GCCA7CCG 
3060 
A77C7GAGAA 
3110 
CACGGGA7AA 
3160 
GG AAAACG 77 



1870 
C7GCCGC77A 
1920 
GC7T7C7CAA 
1970 
GC7CCAAGC7 
2020 
GCC77A7CCG 
2070 
ATCGCCAC7G 
2120 
7AGGCGG7GC 
2170 
AGAAGGACAG 
2220 
AAAAAGAG77 
2270 
G7GG777777 
2320 
CAAGAAGA7C 
2370 
AAAC7CACGT 
2420 
CC7AGATCC7 
2470 
7A7GAG7AAA 
2520 
7ATC7CAGCG 
2570 
7CG7G7AGAT 
2620 
GCAA7GA7AC 
2670 
AAACCAGCCA 
2720 
CCGCC7CCA7 
2770 
TCGCCAG77A 
2820 
GG7G7CACGC 
2870 
GA7CAAGGCG 
2920 
TCC77CGG7C 
2970 
AC7CA7GG7T 
3020 
TAAGATGC77 
3070 
7AG7G7A7GC 
3120 
ii^CCGCGCCA 
3170 



iseo 

CCGGA7ACC7 
1950 
7GC7CACGC7 
1980 
GGGC7G7G7G 
2030 
G7AAC7A7CG 
2080 
GCAGCAGCCA 
2130 
7ACAGAG77C 
2180 
7A7T7GG7A7 
2230 
GG7AGC7C77 
2280 
7G7TTGCAAG 
2330 
C7T7GATC77 
2380 
7AAGGGA777 
2430 
7T7AAA77AA 
2480 
C7TGG7C7GA 
2530 
A7C7G7C7A7 
2580 
AAC7ACGA7A 
2630 
CGCGAGACCC 
2680 
GCCGGAAGGG 
2730 
CCAGTC7A77 
2780 
A7AG777GCG 
2830 
7CG7CG777G 
2880 
AG77ACA7GA 
2930 
C7CCGA7CG7 
2960 
ATGGCAGCAC 
3030 
77C7G7GAC7 
3080 
GGCGACCGAG 
3130 
CA7AGCAGAA 
3180 
AAAAC7C7CA 



1890 
G7CCGCC777 
1940 
G7AGG7A7C7 
1990 
CACGAACCCC 
2040 
7C7TGAG7CC 
2090 
CTGG7AACAG 
2140 
77GAAG7GG7 
2190 
CTGCGCTC7G 
2240 
GA7CCGGCAA 
2290 
CAGCAGA77A 
2340 
77C7ACGGGG 
2390 
7GG7CA7GAG 
2440 
AAA7GAAG77 
2490 
CAG77ACCAA 
2540 
77CG77CA7C 
. 2590 
CGGGAGGGC7 
2640 
ACGC7CACCG 
2690 
CCGAGCGCAG 
2740 
AAT7G77GCC 
2790 
CAACG77G77 
2840 
G7A7GGC7TC 
2890 
7CCCCCA7G7 
2940 
7G7CAGAAG7 
2990 
TGCA7AA77C 
3040 
GG7GAG7AC7 
3090 
77GC7C77GC 
3140 
C77TAAAAG7 
3190 



1900 
C7CCC77CGG 
1950 
CAG77CGG7G 
2000 
CCG77CAGCC 
2050 
AACCCGGTAA 
2100 
GA77AGCAGA 
2150 
GGCC7AAC7A 
2200 
C7GAAGCCAG 
2250 
ACAAACCACC 
2300 
CGCGCAGAAA 
2350 
7C7GACGC7C 
2400 
A77A7CAAAA 
2450 
77AAATCAA7 
2500 
7GC77AA7CA 
2550 
CA7AG77GCC 
2600 
7ACCA7C7GG 
2650 
GC7CCAGA7T 
-2700 
AAG7GGTCC7 
2750 
GGGAAGC7AG 
2800 
GCCA77GC7G 



AGGA7C77AC 



2850 
A77CAGC7CC 
2900 
TG7GCAAAAA 
2950 
AAG77GGCCG 
3000 
7C7rAC7G7C 
3050 
CAACCAAG7C 
3100 
CCGGCG7CAA 
3150 
GC7CA7CA77 
3200 
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FIGURE 7 (continued) 



3210 
ATCCAGTTCG 

3260 
TTACTTTCAC 

3310 
GCAAAAAAGG 

3360 
CCTTTTTCAA 

3410 
GATACATATT 

34 60 
ACATTTCCCC 

3510 
GACATTAACC 



3220 
ATGTAACCCA 

3270 
CAGCG7TTCT 

3320 
GAATAAGGGC 

3370 
TATTATTGAA 

3420 
TGAATGTATT 

3470 
GAAAAGTGCC 

3520 
TATAAAAATA 



3230 
CTCG7GCACC 

3280 
GGGTGAGCAA 

3330 
GACACGGAAA 

3380 
GCATTTATCA 

3430 
TAGAAAAATA 

3480 
ACCTGACGTC 

3530 
GGCGTATCAC 



3240 
CAACTGATCT 

3290 
AAACAGGAAG 

3340 
TGTTGAATAC 

3390 
GGGTTATTGT 

3440 
AACAAATAGG 

3490 
TAAGAAACCA 

3540 
GAGGCCCTTT 



3250 
TCAGCATCTT 

3300 
GCAAAATGCC 

3350 
TCATACTCTT 

3400 
CTCATGAGCG 

3450 
GGTTCCGCGC 

3500 
TTAXrATCAT 

3550 
CGTCTTCAAG 



AA 
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r"IGuKE 8 



10 20 30 40 50 

GTTGACACAT ATGAGTCTTG TGATGTAC7G GCTGATTTCT ACGACCAG7T 
fiO 70 80 90 100 

CGC7GACCAG TTGCACGAGT CTCAATTGGA CAAAATGCCA GCACT"-CCGG 

120 130 140 150 

CTAAAGGTAA CTTGAACCTC CGTGACATCT TAGAGTCGGA CTTCGCGTTC 

160 170 180 190 200 

GCG7AACGCC AAATCAATAC GACTCACTAT AGAGGGACAA ACTCAAGGTC 

220 230 240 250 

AT7CGCAAGA GTGGCCT7TA TGAT7GACC7 TC7TCCGG7T AA7ACGAC7C 

260 270 280 290 300 

ACTA7AGGAG AACC7TAAGG T77AACTT7A AGACCC7TAA G7G7'^AA7-^A 

310 320 330 340 350 

GAGA777AAA T7AAAGAA7T AC7AAGAGAG GAC7TTAAG7 A7GCG7AAC7 

360 370 380 390 400 

7CGAAAAGAT GACCAAACGT TCTAACCG7A ATGC7CG7GA C77CGAGGCA 

410 420 430 440 450 

ACCAAAGG7C GCAAG77GAA TAAGACTAAG CGTGACCGCT C7CACAAGCG 

460 470 480 490 500 

7AGC7GGGAG GG7CAG7AAG A7GGGACG7T 7A7ATAG7GG TAA7C7GGCA 

510 520 530 540 550 

CCGGATCCGG TA7GAAGAGA 77G77AAG7C ACGATAATCA ATAGGAGAAA 

560 570 580 590 600 

TCAA7A7GA7 CG777C7GAC ATCGAAGC7A ACGCCCTC7T AGAGAGCG7C- 
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riGuRE 8 (continued) 

— .^cr.c^ ,,„,,,-o ^^^^^^ 
MccT^cccT ccc.CTc.cr TcccxcccT^ xc.c.rcl^ cccW?? 

.CCXTCC.C3 .CCCCCXCXX .TXCTCTXCC .C^CCG^^^ c;UCT.X^|? 



GXXCCXGCAX XGACCAA^CX GGCAAAGT~G CAAXXGaI?? r.., 

810 820 ^^^TG^CC GAGAGXXCCA 

830 840 

rcxA XXGACACCCX XGXGXXGXCA 
8/0 880 890 

•ACC GAXAXGGGXC xxcxgcgxxc 

CCCC=«MC =CT„=.«= TCCOCT^ C«CC=.^^^ .XXXTCC^I? 



810 820 Q^n ^H^,AGTTCCA 



CC=.„T.C= .T.„CgC= TTCTOO^cI^ «XCCC^J|° =,cC=Ti^- 

CCC=TXX«C «x«.cc^ xcc«cS .x.„x^^J^ xx«cxS 
occccxcxc. .xx=cxcc=x «.xx4 ^coT^SS „c=x=ii?? 



c«ccx«.c cx=.c.cx« »xcxx^ c.xc==i5?? c«=x;JL?? 

«x,ccx«j X.CCCXCJCJ xx«c.^ x«..xi^l? .cxxxci^S 

xa«.ccx« =„c«.o„ c«c=. ^ ccccxcl^'? xx=c.;^^?? 

««ccc=cc .=x.c=xx== xcoxoc ^ x.„cci^2 xx=«c^!^? 

.cxcxxx^c ccxxccxcxc =x.«cic xc.c«JII^ cxccx.^5»° 

cx«=x«=x ccc=«=„c x««c I «==x=i|^? x=xo=x^5L? 

«Tc«cx.c xc=«=o«x .c=x=x o.cccxi I? «c«cil?? 

XAXCGACC~r «s»o,Z^ ■'■'-^ 1740 ii^n 

irti<_ij«(.i..C rtiirtAftGAGT ACXXGA7GAX "CAr"! 

1760 1770 i-^pn * " AXCGGACAGX 

CXGCTGAGGG AGACAAAGCA XGGCXXCg'- A~G— -r^ r--~^ 

1810 1320 — GGA7GGTAAG . 

AXXCAXGGXX CTCX^AACCC ^r-^--, 

1860 ^^87^ =~~-CGGGXC GXGCGACCCA 

TGCGXXCCCA AACCXXGCGC AJJ^X'CC-- ^^00 

1910 1O20 lo^X - = ^'^C=^~CX CCTXAXGGAG 

AGCAGXGrCG CGCXGC--— Gac—r-ll^ ^'"'^ 1550 

Gv,v,«v..G«GC ACCAXrXGGA XGGGAXAACT 
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FIC^JBE 3 (continued) 



1960 
GGTAAGCCTT 
2010 
CTGCTTGGCT 
2060 
AGA7TCTTAA 
2110 
CCTACCCGAG 
2160 
TGG7GATGAG 
2210 
AGGAACTCAA 
2260 
CGCGAGTCTA 
2310 
TGAGCAACAA 
2360 
GTAAGGTACA 
2410 
TCTGCTGGTG 
2460 
GCTCG7AGAG 
2510 
TGGCATGGGT 
2560 
GCTCAGGTGG 
2610 
CCACTGGAAC 
2660 
ATTGGGCGAT 
2710 
TTAACAGG7G 
2760 
7GGG7ACAC7 
2810 
77G7A7G7AA 
2860 
ACGG77CAAA 
2910 
CGG7AGG7CC 
2960 
77GACGAAGA 
3010 
7CCA7G7G7G 
3060 
A77A7GGC7A 



1970 
GGG77CAGGC 
2020 
CAC 1 7CA7GG 
2070 
CGGCGACA7C 
2120 
A7AACGC7AA 
2170 
AAGA77GGAC 
2220 
GAAGAAA77C 
2270 
7CCAACAGAC 
2320 
G7CAAG7GGA 
2370 
CG77CG7AG7 
2420 
C7C7CA7C7G 
2470 
AAAGGC77GA 
2520 



ACA7GA7GAA 
2570 
7CA77GAGAC 
2620 
77CCGG7G7C 
2670 
77GCCAC7GA 
2720 
C7GC77CTGA 
2770 
G7C7ATTACC 
2820 
GGA7GG7AAA 
2870 
CCAACACAGG 
2920 
GAA7A7AAGG 
2970 
7G7GC77A77 
3020 



1960 
7GGCA7CGAC 
2030 
C7CGC777GA 
2080 
CACAC7AAGA 
2130 
GACG77CA7C 
21S0 
AGA77G77GG 
2230 
C77GAGAACA 
2280 
AC77G7CGAG 
2330 
AACGCCGC7G 
23S0 
CC7CACGC7G 
2430 
CAAAC7G7GG 
2480 
AGCA7GGC7G 
2530 
A7CCAAG7AG 
2580 
CGCACAAGAA 
2630 
77C7GGA7AC 
2680 
7ACAGGAGGC 
2730 
AA7GC7AG7A 
2780 
C7A7GC7GAC 
2830 
777AG7AAGG 
• 2880 



1990 
GCA7CCGG7C 
2040 



7GA 7AACGGCGAG 



AAGAG 
3070 



AGA7GCCAAG 

2920 
A7GGAGAC77 

29S0 
77CACA7GGG 

3030 
AAACAAAGGC 

3080 



2090 
ACCAGA7AGC 
2140 
7A7GGG77CC 
2190 
7GC7GG7AAA 
2240 
CCCCCGCGA7 
2290 
7CC7C7CAAT 
2340 
GA77AAAGG7 
2390 
CC77GAA7AC 
2440 
A77A7CAAGA 
2490 
GGA7GGGGAC 
2540 
GC7GCCG7AC 
2590 
GCGA7GCGC7 
2640 
CGAAGG7AAG 
. 2690 
7AC7CA7GAA 
2740 
GCC7ACAAA7 
2790 
7CAGAG7AAA 
2840 
77CAGG77AA 
2890 
CAGG77AGGC 
2940 
7GACA77C77 
2990 
ACGAAG7AAA 
3040 
A7AAAAC7A7 



2CCC 
77GAGC7ACG 
2050 
7ACGC7CACG 
2100 
7GC7GAAC7A 
2150 
7C7A7GG7GC 
2200 
GAGCGCGG7A 
2250 
7GCAGCAC7C 
2300 
GGG7AGC7GG 
2350 
C7GGA7GG7C 
2400 
CC7AC7GCAA 
2450 
CCGAAGAGA7 
2500 
777GCG7ACA 
2550 
CGAAGAGA77 
2600 



GGG77GGAGA 
2650 
A7GGG7CC7A 
2700 
CGA-AAGACAC 
2750 
A * tCCAAAGC 
-. 2800 
GAGGAC77GG 
2850 
AACAGCCACA 
2900 
7AGG7GGA7G 
2950 
GCGG77G7GG 
3000 
AGG7AAGACA 
3050 
AGGAGAAA77 
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A.nTGCTACTA 



60 

AAATGAAAAT 
110 

ATGGTCAAAC 
160 

TGGAATGAAA 
210 

TGAGCTACAG 
2 60 

TGACCTCTTA 
310 

TTGGAGTTTG 
360 

ATATTTGAAG 
410 

. TTGCTTCTGA 
460 

TCATTC7CGT 
510 

TATTTATGAC 
560 

CTATTACCCC 
610 

GGTTTTTATC 
660 

TATGCCTCGT 
710 

GTATTCCTAA 
7 60 

CCGTTAGTTC 
810 

GTATAATGAG 
' 860 



10 20 
CTATTAGTAG 



70 

ATAGCTAAAC 
120 

TAAATCTACT 
170 

CTTCCAGACA 
220 

CACCAGATTC 
270 

TCAAAAGGAG 
320 

CTTCCGGTCT 
370 

TCTTTCGGGC 
420 

CTATAATAGT 
470 

TTTCTGAACT 
520 

GATTCCGCAG 
570 

CTCTGGCAAA 
620 

GTCG7CTGGT 
670 

AATTCCTTTT 
720 

ATCTCAACTG 
770 

GTTTTATTAA 
820 

CCAGTTCTTA 
870 



30 

AATTGATGCC 
80 

AGGTTATTGA 
130 

CGTTCGCAGA 
180 

CCGTACTTTA 
230 

AGCAATTAAG 
280 

CAATTAAAGG 
330 

GGTTCGCTTT 
380 

TTCCTCTTAA 
430 

CAGGGTAAAG 
480 

GTTTAAAGCA 
530 

TATTGGACGC 
580 

ACTTCTTTTG 
630 

AAACGAGGGT 
680 

GGCGTTATGT 
730 

ATGAATCTTT 
780 

CGTAGATTTT 
830 

AAATCGCATA 
880 



40 

ACCTTTTCAG 
90 

CCATTTGCGA 
140 

ATTGGGAATC 
190 

GTTGCATATT 
240 

CTCTAAGCCA 
290 

TACTCTCTAA 
340 

GAAGCTCGAA 
390 

TCTTTTTGAT 
440 

ACCTGATTTT 
490 

TTTGAGGGGG 
540 

TATCCAGTCT 
590 

CAAAAGCCTC 
640 

TATGATAGTG 
690 

ATCTGCATTA 
740 

CTACCTGTAA 
790 

TCTTCCCAAC 
840 

AGGTAATTCA 
890 



50 

CTCGCGCCCC 
100 

AATGTA7CTA 
ISO 

AACTGTTACA 
200 

TAAAACATGT 
250 

TCCGCAAAAA 
300 

TCCTGACCTG 
350 

TTAAAACGCG 
400 

GCAATCCGCT 
450 

TGATTTATGG 
500 

ATTCAATGAA 
550 

AAACATTTTA 
600 

TCGCTAT7TT 
650 

TTGCTCTTAC 
700 

GTTGAATGiq 
750 

TAATGTTG7T 
800 

G7CC7GAC7G 
850 

CAA7GA77AA 
900 
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FIGLTE 9 (ccntir.'jec) 

AGTTGAAATT AAACCATCTC AAGCCCAJ^.T? ~AC-AC 

CTC«CAG=G CAASCCT7AT TCACTG^rc «CA=C7~? TTACGTTOa? 
XT.C„AA.= AAXA.Cca=. .c„=.|^ A„AC.c| A.AACa^g 

TXC=.CA=3 ^-Hi -.=cil?£ 
AACTAACATG GAGCAG57CG CCGATTTCGA CACAATTTW CAGGCgItgA 
TACAAAICTC CGIIGIACTT IGITTCOCK TTGCTAlil? CGCTGGGGg? 

caaagaxgag xgxxxxagxg XAXxGx|lg c=xcxx|^? „agg^| 

XGCCXXCGXA GXGGCAXXAC GXAXXXXACC CGXXXA^ G AAACTX 

AIGAAAAAGX CTITAGXCCT CAAAGCCXCT GTAGCCGXTC CXACCCXc" 

1360 1370 1360 lion i^nn 

TCCGATGCTG TCTTTCGCTG CTGAGGGTGA CGATCCCGCA AAAGCGGC?? 

TTAJ.CTCCCT GCAAGCCTCA GCGACCgH? ATATCGGTTA TGCGTGGGcS 
1460 1470 1480 1490 i Qnn 

ATGGTTGTTG TCATTGTCGG CGCAACTATC GGTATCAAG? TCTTTA^a^i 

^^-^^ ^^20 1530 1540 iccn 

ATTCACCTCG AAAGCAAGCT GATAAACCGA TACAATTAAA GGCTCCT"'- 
'^^^^ ^5*^0 1550 1590 ^'kr^n 

GGAGCCTTTT TTTTTGGAGA TTTTCAACGT GAAAAAATTA TTAT-CGC^i 
^^^^ 1620 1630 li;40 ^(^^r\ 

TTCCTTTAGT TGTTCCTTrC TATTCTCACT CCGCTGA^AC TGirGA^lG? 

TG-.T.-,Jf!° • 1650 1700 

?7?? AACCCCATAC AGAAAATTCA TTTACTAACG 7CTGGAAAGA 
1710 1720 . 1730 1740 '-,-^0 

CGACAAAACT TTAGATCGTT ACGCTAACTA TGAGGG77GT CTGTGGAA-G 
I'oO 1770 17S0 1790 1 o^^'^ 

CTACAGGCGT TGTAGTTTGT ACTGGTGACG AAACTCAGTG TTACGGTaS 

1820 1830 1S40 Tccrt 

'''''''''^illi ^-^ccc^^^ aatgagggtg ctgcctct^a 

1870 1880 ificn i 

GGGTGGCGGX TCTGAGGGXG ====XXCTGA GGGXGGC«? ACXAAACC?" 

12z0 1^30* iQ^ri 

CTGAGTACGG TGATACACCT ATTCCGGaCT ATACT™? CAACCCTC"? 

1560 1970 1S80 iqcq onn^ 

GACGGCACXT AXCCGCCXGG XACXGAGCAA AACCCcfeJ ;^CCrJ^ 

XXCXCxl?AG GAGXCrC«J crCT.A??!? ...CAXG?i? CAG-^??^? 

2"!) 269; 

GGT7CCGAAA TAGGCAGGGG GCATTAACTC; TTTAXACGGG CAC75t?«- 

2120 2130 2"'40 

CAAGGCACTG ACCCCGTTA^ AACTTATTAC CAGTACAaTC CTGrATCATC 
2I0O 2170 2160 2190 220'^ 

AAAAGCCATG TATGACGCTT ACTGGAACGG TAAATTCAGA GACTGCGCTT 
22-0 22^0 2230 22<0 22=0 
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rlQJHE 9 (continued) 



TCCATTCTGG 
2260 
TCGTCTGACC 
2310 
TGGTTCTGGT 
2360 
AGGGTGGCGG 
2410 
GATTTTGA7T 
24 60 
AAATGCCGAT 
2510 
CTGTCGCTAC 
2560 
TCCGGCCTTG 
2610 
TTCCCAAATG 
2660 
AT7TCCGTCA 
2710 
TTTG7CTTTA 
2760 
AATAAACTTA 
2810 
TTATGTATGT 
2860 
TAATCATGCC 
2910 
TTCCTTCTGG 
2960 
CTTCGGTAAG 



3010 
GGCTTAACTC 

3060 
CCC7CTGACT 

3110 
TCCCTGTTTT 

3160 
ACGTTAAACA 

3210 
TGTTTATTTT 

3260 
TTGGTAAGAT 

3310 
CTTGATTTAA 

3360 
GCCTCGCGTT 

3410 

34 60 
GTTCTCGATG 

3510 
GGAAAGACAG 

3560 



2270 
TGCCTCAACC 
2320 
GGCGGCTCTG 
2370 
CTCTGAGGGA 
2420 
ATGAAAAGAT 
2470 
GAAAACGCGC 
2520 
TGATTACGGT 
2570 
CTAATGGTAA 
2620 
GCTCAAGTCG 
2670 
ATATTTACCT 
2720 
GCGCTGGTAA 
2770 
TTCCGTGGTG 
2820 
AT7TTCTACG 
2870 
AG77C7777G 
2920 
7AAC777G77 
2970 
A7AGC7A77G 
3020 
AA77C77G7G 
3070 
T7G77CAGGG 
3120 
7A7G77A77C 
3170 
AAAAA7CG77 
3220 



GA7CCA77Co 
22S0 
TCC7G7CAA7 
2330 
AGGG7GG7GG 
2380 
GGCGG77CCG 
2430 
GGCAAACGC7 
2480 
TACAG7C7GA 
2530 
GC7GC7A7CG 
2560 
TGG7GC7AC7 
2630 
G7GACGG7GA 
2680 
7CCC7CCC7C 
2730 
ACCA7A7GAA 
2780 
7C777GCG77 
2830 
777GC7AACA 
2880 
GG7A77CCG7 
2930 
CGGC7A7C7G 
2980 



77TG7G;iJ.7A 7CAAGGCC;^-\ 



G7AAC7GGCA 
3270 

TCAGGA7AAA 
3320 

GGC77CAAAA 
3370 

3420 

3470 

352C 

3570 



C7A777CA77 
3030 
GG77A7C7C7 
- 3080 
TG77CAG77A 
3130 
TC7C7G7AAA 
3180 
TC77A777GG 
3230 
AA77AGGC7C 
3260 
A77G7AGC73 
3330 
CC7CCCGCAA 
3350 
CGGA7AAGCC 
3430 

3480 
77GG777AA7 

— w u 

r-.77GG777C7 
3560 



2290 
GC7GGCGGCG 
2340 
CTC7GAGGG7 
2390 
GTGG7GGC7C 
2440 
AA7AAGGGGG 
2490 
CGC7AAAGGC 
2540 
A7GG777CA7 
2590 
GG7GA77T7G 
2640 
7AAT7CACC7 
2690 
AA7CGG77GA 
2740 
TT77C7A77G 
2790 
7C7777A7A7 
2840 
7AC7GCG7AA 
2890 
7A77A77GCG 
2940 
C77AC777TC 
• 2990 
G777C77GC7 
3040 
C7GA7A77AG 
3090 
AT7C7CCCG7 
3140 
GGC7GC7A7X 
3190 
A77GGGA7AA 
3240 
7GGAAAGACG 
3290 
GG7GCAAAA7 
3340 
G7CGGGAGG7 
3390 
T7C7A7A7C7 
3440 
AAAA7AAAAA 
3490 
ACCCG77C77 
3540 
ACA7GC7CG7 
3590 



2300 
GC7C7GG7GG 
2350 
GGCGG77C7G 
2400 
7GG77CCGG7 
2450 
C7A7GACCGA 
2500 
AAAC7XGA77 
2550 
7GG7GACG77 
2600 
C7GGC7C72A 
2650 
T7AA7GAA7A 
2700 
A7G7CGCCC7 
2750 
A77G7GACAA 
2800 
G77GCCACC7 
2850 
7AAGGAG7C7 
2900 
777CC7CGG7 
2950 
77AAAAAGGG 
3000 
C77A77A77G 
3050 
CGC7CAA77A 
*3100 
C7AA7GCGC7 
3150 
77CA77777G 
3200 
A7AA7A7GGC 
3250 



C7CG77AGCG 
3300 
AGCAAC7AA7 
3350 
TCGC7AAAAC 
3400 
GA777GC77G 
3450 
CGGC77GC77 
3500 
GGAA7GA7AA 
3550 
AAA77AG7A7 
3600 
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FIGLni 9 (oontinued) 
GGGATATTAT TTTTCTTGTT CAGGAC7TAT CTATrGTTGA TAAACAGGCG 

CGTTCTGCAT TAGC7GAACA TGTTGX^TA? TGTCGTCGT? TGGACAg"? 

-^^^^ ^^^0 3650 56^0 -^-rnA 

TACTTTACCT TTTGTCGGTA CTTTATAXtC TCTTATTACT GGCTCgIA^ 

^'^^ ^^20 3730 3740 Wen 

TGCCTC7GCC TAAATTACAT GTTGGCGTTG TTAAATATGG CGATTCTcl? 

TTAAGCCCTA CTGTTGAGCG TTGGCTTTA? ACTGGtUgS ATTTGtI?^ 

^^^^ 3820 3830 3840 -loVX 

CGCATATGAT ACTAAACAGG CTTTTTCTAG TAATTATGAT TCCGGTCT^^ 

3860 3870 3880 3890 -^onn 

ATTCTTAT7T AACGCCTTAT TTATCACACG GTCGG7ATTT CAAACCaI?a 

3910 3920 3930 3940 -ialn 

AA777AGG7C AGAAGA7GAA AT7AAC7AAA A7A7A7T7GA AAAAG7777C 

7CGCG7TC77 TG7C77GCGA 77GGA777G? ATCAGcl?'? ACATATAG?? 

^-^^ ^020 4030 4040 Ancn 

""^""^^JoSO ""^^^^^O?^ GAGG77AAAA AGG7AG7C7? TCAGACcSl? 

4070 4080 40Q0 A^r^n 

GA7777GA7A AA77CAC7A7 7GAC7C77C7 CAGCG7C?Ia A7C7AAGC?a 

TCGC7A7G77 T7CAAGGA77 C7AAGGG^ A77AA77^? AGCGACGaS 

7ACAGAAGCA AGG77A77CA C7CACATATA TTGA777i?G TAC7G777?? 

A77AAAAAAG GTAA7TOAA 7GAAA7^G7? AAA7G7li7? AA7777G7l~ 

.....c^ .cx.cxi|?2 crc^ ..^^ 

^'^^^ 4320 4330 4*^40 ^irjcn 

«TTC==CTC TGC=C=ATTT TST«CTT=0 TATTCA^^o? 

="'""5= ATGIM^^^S lACIGTT^c? GTAT«T«? 
CTGAC=TT« ACTTGA^ll? CTACCcJI^ TCTTTATTT? TCTTTTA«? 

TAATCCAAAC AATCiGGTAT AIAITGaIga ATTGC«"a TCI=AT«IC 

— £ "Hi =Hi —1 ~ 

""'"jtlo TTTTAAAATt AATAACGTTC GGGCAA^^J? 

4560 4670 4650 46cn ^?nn^ 

TTTAA.ACGA Gr.G.CGAAT rGTT.GTAAA GTCTAA^^i? .CTAAA^ c? 

4720 4730 4740 47^n 

"""^""Vnl "^^'""^I^^S ^""^^"^IS TA7TAG-7G7 7AG7GCACC? 

47/0 4780 47^0 ^roo 

AA^GA7A777 7AGA7AACCT TCC7CAA77C C777C7AC7G 77nXTT-.?r- 



*810 4820 4835 4840 '"""^^4850 

^"^'^Js^^ A777GAGG7? CAGCAAGG?? 

^j870 4880 4890 4900 

..GC. .G. ^-T77CA777 GC7GC7GGC7 C7CAGCG7GG CAC7G77GCA 

^--^ 4920 45^0 4S40 4950 
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FIGUPE 9 (continued) 



GGCGGTG7TA ATACTGACC3 



4960 
TTCGTTCGGT 
5010 
TAAAGACTAA 
5060 
CTTTCAGGTC 
5110 
TACTGGTCGT 
5160 
CGATTGAGCG 
5210 
ATGGCTGGCG 
52 60 
GAGTTCTTCT 
5310 
CTACAACGGT 
5360 
ACTGATTATA 
5410 
AATCCCTTTA 
5460 
AAAGCACGTT 
5510 
CGGCGCATTA 
5560 
CACTTGCCAG 
5610 
CTCGCCACGT 
5660 
TTTAGGGTTC 
5710 
ATTTGGGTGA 
5760 



4970 
ATTTTTAATG 
5020 
TAGCCATTCA 
5070 
AGAAGGG7TC 
5120 
GTGACTGGTG 
5170 
TCAAAATGTA 
5220 
GTAATATTGT 
5270 
ACTCAGGCAA 
5320 
TAATTTGCGT 
5370 
AAAACACTTC 
5420 
ATCGGCCTCC 
5470 
ATACGTGCTC 
5520 
AGCGCGGCGG 
5570 
CGCCCTAGCG 
5620 
TCGCCGGCTT 
5670 
CGATTTAGTG 
5720 
TGGTTCACGT 
5770 



CGCCCTTTGA CGTTGGAGTC 
5810 5820 



CCTCACCTCT 
4980 
GCGATG7TTT 
5030 
AAAATATTGT 
5080 
TATCTCTGTT 
5130 
AATCTGCCAA 
5130 
GGTATTTCCA 
5230 
TCTGGATATT 
5280 
GTGATGTTAT 
5330 
GATGGACAGA 
5380 
TCAAGATTCT 
5430 
TGTTTAGCrC 
5480 
GTCAAAGCAA 
5530 
GTGTGG7GGT 
5580 
CCCGC7CC7T 
5630 
7CCCCG7CAA 
5660 
C777ACGGCA 
5730 
AG7GGGCCA7 
■ 5780 
CACGT7C777 
5830 



AAC7GGAACA ACAC7CAACC C7A7CTCGGG 



5860 
GGA7777GCC 
5910 
GGCAAACCAG 
5960 
AAGGGCAA7C 
6010 
GGCGCCCAA7 
6060 
7CCAGC7GGC 
6110 
CG C AA77AA7 
6160 
TTATGCTTCC 
6210 
CACACAGG AA 
6260 



5870 
GA777CGGAA 
5920 
CG7GGACCGC 
5970 
AGC7G77GCC 
6020 
ACGCAAACCG 
6070 
ACGACAGG7r 
6120 
G7GAG77ACC 
6170 
GGC7CG7ATG 
6220 
ACAGC7A7GA 
6270 



5680 
CCACCA7CAA 

5930 
77GC7GCAAC 

5960 
CG7C7CGC7G 

6020 
CC7C7CCCCG 

6080 
TCCCGACTGG 

6130 
7CACTCA77A 

6180 
77G7G7GGAA 

6230 
CCA7GA77AC 

6280 



G7777A7C77 
4990 
AGGGC7A7CA 
5040 
C7G7GCCACG 
5090 
GGCCAGAA7G 
5140 
7G7AAA7AA7 
5190 
TGAGCG77TT 
5240 
ACCAGCAAGG 
5290 
TACTAATCAA 
5340 
C7C7TTTACT 
5390 
GGCGTACCG7 
5440 
CCGC7C7GA7 
5490 
CCA7AG7ACG 
5540 
7ACGCGCAGC 
5590 
7CGC777C7T 
5640 
GC7C7AAA7C 
5690 
CC7CGACCCC 
5740 
CGCCC7GA7A 
5790 
AA7AG7GGAC 
5840 
C7A77CTTT7 
5890 
ACAGGA7T77 
5940 
7C7C7CAGGG 
5990 
G7GAAAAGAA 
6040 
CGCG77GGCC 
6090 
AAAGCGGGCA 
6140 
GGCACCCCAG 
6190 
77G7GAGCGG 
6240 
GAA77CGAGC 
6290 



C7GC7GG7GG 
5000 
GT7CGCGCA7 
5050 
7ATTC7TACG 
5100 
7CCCT77TA7 
5150 
CCA77TCAGA 
5200 
7CC7G77GCA 
5250 
CCGA7AG777 
5300 
AGAAGTA77G 
5350 
CGG7GGCC7C 
5400 
7CC7G7C7AA 
5450 
7CCAACGAGG 
5500 
CGCCC7G7AG 
5550 
G7GACCGC7A 
5600 
CCC77CC777 
5650 
GGGGGC7CCC 
5700 
AAAAAAC77G 
5750 
GACGGX7777 
*5800 
7C77G77CCA 
5850 
GA777ATAAG 
5900 
CGCC7GC7GG 
5950 
CCAGGCGG7G 
6000 
AAACCACCC7 
6050 
GA77CA7TAA 
6100 
G7GAGCGCAA 
6150 
GC77TACAC7 
6200 
A7AACAA777 
6250 
7CGCCCGGGG 
6300 
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riGC.«£. 9 (continued) 



ATCTGCCTGA 
6310 

ACGATTAACA 
63 60 

CCCGTTCAAC 
6410 

AGTTGGCCCT 



6460 
AAGATGTTTG 
6510 
CGCCAAGCCT 
6560 
ACGACTGGTT 
6610 
TTCCAGTTCC 
6660 
TAAGACCACT 
6710 
CTGTAGCAAG 
6760 
CGTATCCG7G 
6810 
ACTCAACAAG 
6860 
TCGAGGCTGA 
6910 
TCGrGGCATA 



ATAGGTACGA 
6320 
TCGCTAAGAA 
6370 
ACTCTGGCTG 
6420 
TGAGCATGAG 
6470 
AGCGTCAACT 
6520 
CTCATCACTA 
6570 
TGAGGAAGTG 
6620 
TGCAAGAAAT 
6670 
CTGGC7TGCC 
6720 
CGCAATCGGT 
6770 
ACC7TGAAGC 
6820 
CGCGTAGGGC 
6870 
CATGC7CTCT 
6920 



TTTACTAACr 
6330 

CGACTTCTCr 
6350 

ACCATTAC GG 
6430 

TCTTACGAGA 



6560 
GCTCATTGAG 
7010 
TAGTAGGTCA 
7060 
GCTATCGCAA 
7110 
ACCTTGCGTA 
7160 
ATTGGGCTAA 
7210 
AAAGCACTGA 
72 60 
GATTAACATT 
7310 
CGG7CGCCAA 
7360 
CC7GCGA7TG 
7410 
GAA7CC7GAG 
74 60 
GCAAGGACAA 
7510 
GAGCAAGCCA 
7560 
CA7GGAC7GG 
7610 



AGGAAGAC7C 
6970 
TCAACCGGAA 
7020 
AGAC7C7GAG 
7070 
CCCG7GCAGG 
7120 
G7TCC7CC7A 
7170 
CGG7CGTCG7 
7220 
TGCGC7ACGA 
7270 
GCGCAAAACA 
7320 
CG7AA7CACC 
7370 
AGCG7GAAGA 
7420 
GC7C7CACCG 
7470 
GGC7CGCAAG 
7520 
A7AAG777GC 
7570 
CGCGG7CG7G 
7620 



6480 
7AAAGC7GG7 
6530 
CCC7AC7CCC 
6580 
AAAGCTAAGC 
6630 
CAAGCCGGAA 
6680 
7AACCAG7GC 
6730 
CGGGCCA77G 
6780 
7AAGCAC77C 
6830 
ACG7C7ACAA 
6880 
AAGGG7C7AC 
6930 
7A77CATG7A 
6S60 
TGG7TAGC77 
7030 
AC7ATCGAAC 
7080 
7GCGC7GGC7 
:. 7130 
AGCCG7GGAC 
7180 
CC7C7GGCGC 
7230 
AGACG777AC 
7280 
CCGCA7GGAA 
7330 
AAG7GGAAGC 
7360 
AC7CCCGA7G 
7430 
CG7GGAAACG 
7460 
7C7CGCCG7A 
7530 
TAACCA7AAG 
7580 
777ACGC7G7 
7630 



GGAAGAGGCA 
6340 
GACA7CGAAC 
6390 
7GAGCG777A 
6440 
7GGG7GAAGC 
6490 
GAGG77GCGG 
6540 
7AAGA7GA77 
6590 
GCGGCAAGCG 
6640 
GCCG7AGCG7 
6690 
7GACAA7ACA 
6740 
AGGACGAGGC 
6790 
AAGAAAAACG 
6840 
GAAAGCA777 
6890 
7CGG7GGCGA 
6940 
GGAG7ACGC7 
6990 
ACACCGCCAA 
7040 
7CGCACC7GA 
7090 
GGCA7C7C7C 
7140 
7GGCA77AC7 
7190 
7GG7GCG7AC 
7240 
A7GCC7GAGG 
7290 



AA7CAACAAG 
7340 
A77G7CCGG7 
7390 
AAACCGGAAG 
7440 
TGC7GCCGC7 



7490 
TCAGCC7rGA 
7540 
GCCA7C7GG7 
7590 
G7CAA7G77C 
7640 



C7AAA7GA.=.C 
6350 
7GGC7GC7A7 
6400 
GC7CGCGAAC 
6450 
ACGC77CCGC 
65C0 
A7AACGC7GC 
6550 
GCACGCA7CA 
6600 
CCCGACAGCC 
6650 
ACA7CACCA7 
6700 
ACCG77CAGG 
6750 
7CGC77CGG7 
6800 
T7GAGGAACA 
6850 
A7GCAAG77G 
6900 
GGCG7GG7C7 
6550 
GCA7CGAGA7 
7000 
AA7GC7GGCG 
7050 
A7ACGC7GAG 
7100 
CGA7G7.7CCA 
7150 
GG7GG7GGC7 
7200 
rCACAG7AAG 
7250 
'rG7ACAAAGC 
7300 
AAAG7CC7AG 
7350 
CGAGGACA7C 
74CC 
ACA7CGACA7 

GC7G7G7ACC 

7500 
G77CA7GC77 

7550 
TCCC77AC;^J^ 

7600 
'vrtCCCGCAAG 

7 650 
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FIGURE 9 (continued) 



GTAACGATAT 

7660 
GGTAAGGAAG 

7710 
TGTCGATAAG 

7760 
ACGAGAACAT 

7810 



GACCAAAGGA 
7670 
GTTACTACTG 
7720 
GTTCCGTTCC 
7770 
CATGGCTTGC 
7820 



CTGC7TACGC 
76S0 
GCTGAAAATC 
7730 
CTGAGCGCAT 
7780 
GCTAAGTCTC 
7830 



TGGCGAAAGG 
76S0 
CACGGTGCAA 
7740 
CAAGTTCA7T 
77 90 
CACTGGAGAA 
7840 



TAAACC 

77C0 
ACTGTGCGGG 
7750 
GAGGAAAACC 
7800 
CAC7TGGTGG 
7850 



GCT«=C«= .TTCTCCCTT CTGCTTCCTT OCO^cilli „0«lI?L2 



7860 
TGGGGTACAG 
7910 
TTGACGGGTC 
7960 
GAGGTAGGTG 
8010 
CATCTACGGG 
8060 



7870 
CACCACGGCC 
7920 
TTGCTCTGGC 
7970 
GTCGCGCGG7 
8020 



7890 
CTGCTCCCTT 
7940 
TCTCCGCGAT 
7990 
CCTAGTGAAA 
8040 



7900 
CCGCTGGCGT 
7950 
GCTCCGAGAT 
8000 
CCG7TCAGGA 
8050 



7880 
7GAGC7A7AA 
7930 
A7CCAGCAC7 
7960 
7AAC77GC77 

8030 ounu «fi.^n 

ATTOTTGCTa »GAMGTC« CGMAITCTA CAAGCAGACG 
CAAXCAATCG GACCGA.AAC CAAGTA CCGTGaS .CAGAA^ic"? 

GGXGAAATCT C.GAGAAAGT CAAGCXGG ACTAAG .IJc" XGGC.G^^^^ 

AXGGCTGGGT TACGGTGTXA CTCGGA ..CXAaI"? „AGXcI?S° 

o^^u 82o0 8240 poca 

CGGGTCCAAA GAGTTCGGCT TCCGTCAACA AGTCCTGclj 

8230 



8210 
CGC7GGC77A 
8260 
GA7ACCA77C 
8310 
GCCGAA7CAG 
8360 
GCG7GACGG7 
8410 
GC7AAGC7GC 
8460 
7CGCAAGCG7 
8510 
GGCAGGAA7A 
8560 
GG7CAG77CC 
8610 
7GA7GCACAC 
8660 
AAGACGG7AG 
8710 
GGAA7CGAA7 
8760 
TGACGC7GCG 
8810 
A7GAG7CT7G 
8860 
7TGCACGAG7 
8910 
C7rGAACCrC 
8960 



8270 
AGCCAGC7A7 

8320 
GC7GC7GGA7 

8370 
GG7AGC7GCG 

8420 
7GGC7GC7GA 

8470 
7GCGC7G7GC 

8520 



CAAGAAGCC7 
8570 
GC77ACAGCC 
8620 
AAACAGGAG7 
8670 
CCACC77CG7 
8720 
C7777GCAC7 
8770 
AACC7G77CA 
8820 
7GA7GTAC7G 
8870 

8920 
CG7GACA7C7 
8970 



7GA77CCGGC 
8330 
ACA7GGC7AA 
8380 
G77GAAGCAA 
8430 
GG7CAAAGA7 
:. 84 80 
A77GGG7AAC 
8530 
A7TCAGACGC 
8560 
7ACCA77AAC 
8630 
C7GG7A7CGC 
8680 
AAGAC7G7AG 
8730 
GA77CACGAC 
8780 
AAGCAG7GCG 
8830 
GC7GA777C7 
8660 
CAAAA7GCCA 
8930 
TAGAG7CGGA 
8980 



8290 
AAGGG7C7GA 
8340 
GC7GA777GG 
8390 
7GAAC7GGC7 
8440 
AAGAAGAC7G 
8490 
7CC7GATGG7 
8540 
GC77GAACC7 
8590 
ACCAACAAAG 
6640 
7CC7AAC777 
8690 
7G7GGGCACA 
6740 
TCC77CGG7A 
8790 
CGAAAC7A7G 
8840 
ACGACCAG77 
6890 
GCAC77CCGG 
6940 
C77CGCG7TC 
6990 



8300 
7G77CAC7CA 
8350 
GAA7C7G7GA 
8400 
7AAG7C7GCT 
8450 
GAGAGA77C7 
8500 
77CCC7G7G7 
8550 
GA7G77CC7C 
8600 
A7AGCGAGAT 
8650 
G7ACACAGCC 
8700 
CGAGAAG7AC 
8750 
CCA A 7CCGGC 
8800 
G77GACACAT 



8850 



vjACCAG 
8900 
C7AAAGG7AA 
6950 
GCGTAACGCC 
90CC 
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FIGURE 9 (continued) 

-^--J- ««=c=-tc CC.C«CCJC c.=ccc«o= x.ooc.c.== 

!TCGT GACTGGGAAA ACCCTGGCG7 
9070 9080 9090 



CCOTCGTTTT .C«CCT«T «CT==s£ XCCCTG^?^? T.CCCaI?!? 
==CCC=„CC .„C==CCTT CCC«C g =c=X..P^S «X.C=c'il? 



92 60 9270 qoda ^'^^^"^^'-^^'^ ^aaaltggca 

«XO«g=T T.«„OC.C CC«Ct|| c«C=xI^»2 .^.^l^OO 

«.xc«xcc ccc.«|| .„c..g^° 

CXC.C.TX.J „^ ' ^ „^^^|3|0 ^_^^^^_^.^o 

mrrxTOAT ==cottc=tj txc.ttI1| ,„^1^ 
Axxx«c.c« «xxxx«c. «.x„|I cxxx^clil^? 
cxx„.c«x cxxccxcxxx xx<K.«| ;.cc=.ll?^ 
CAx«^xxc «.x=cx.sx xxx.col|? ccc=xxcl?l? .„cx=ll^° 
xxccxcc.^ cxcxcsccj „«cc|^ «„xx||^ c«cxclc'l^ 
«.xx^.= CCXCXCC30C „c«x I c^cxJI^ 
CAX.xxc.xc oxG«xxc« xcxcxcc' cxxxcxc'I^? cxxxxclIL^ 
XXX.CCXJCJ c.xx.cxc« CC.XXC ^ xMMxI^I? c«ccxl??? 



9910 9920 QoVn v,c-.v..GAGGC 

XXX.XXCCXX «xxxx=CT. „xcxxl^?? xxcccxal^; „xxxJ^a2 

ATGTT 
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FIGURE 10 




(interrupted 
byT7 01O) 
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